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ORFxxxxx Annotation 



ORFQ1580 polysaccharide biosynthesis protein, putative 



ORF01612 conserved hypothetical protein 



ORFQ1613 glycosyl transferase, group 1 family protein 



ORF01617 conserved hypothetical protein 



ORFQ1618 conserved hypothetical protein 



ORF01621 glycosyl transferase, putative 



ORFQ1622 glycosyl transferase, group 2 family protein 



ORF01623 glycosyl transferase, family 8, degenerate 



ORF01624 IS1381, transposase OrfB 



IORFQ1625 IS1381, transposase OrfA 



1QRF01626 glycosyl transferase family 8 



1ORF01627 glycosyl transferase, family 8 



1ORF01628 conserved hypothetical protein 



QRF01630 cell wall surface anchor family protein 



[ORF01635 proteaseTputative 



[ORFQ1643 aminopeptidase PepS (pepS) 



ORF01702 peptidase, M2Q/M25/M40 family 



jORF01731 IS1 381, transposase OrfA 



IORF01732 1S1 381, transposase OrfB 



IORFQ1740 tellurite resistance protein TehB (tehB) 



ORF01747 methylated-DNA~protein-cysteine S-methyltransferase (ogt) 



[QRFQ1749 acetyltransferase, GNAT family 



1ORF01763 AcuB family protein __ 

1ORF 01764 branched-chain amino acid ABC transporter, ATP-binding protein (HvF) 



ORF01765 branched-chain amino acid ABC transporter, ATP-binding protein (livG) 
IORF01766 branched-chain amino acid ABC transporter, permease protein 
ORF01767 branched-chain amino acid ABC transporter, permease protein (HvH) 
loRF01769 branched-chain amino acid ABC transporter, amino acid-binding protein" 
ORFQ1775 aminotransferase, class I 
ORF01779 potassium uptake protein, Trk family 
[ORF01780 cation uptake protein, Trk family 
ORF01824 cobalt transport family protein 
1ORF01826 conserved hypothetical protein 

ORF01 832 peptidase, M20/M25/M40 family Z 
1ORF01845 conserved hypothetical protein 

IORF01848 transcriptional regulator, MerR family 

1ORF01853 isochorismatase family protein 
ORF01 859 membrane protein 

ORF01875 oxidoreductase, aldo/keto reductase family 

|ORF01880 phospho-2-dehydro-3-deoxyheptonate aldolase 



[ORF01981 rRNA (guanine-NI-)-methyltransferase, putative 



1ORF02Q83 prophage LambdaSa^DNA replication protein DnaC, putative 
QRF02101 Na+/H+ exchanger family protein 
ORF02107 membrane protein, putative 
|ORF02139 UDP-glucose 4-epimerase (galE) 



1ORF02143 lacX protein 



ORF02162 conserved hypothetical protein 



1ORF02186 hemolysin precursor, putative 



[ORF02192 transcriptional regulator, MerR family 



IORF02195 MutT/nudix family protein 



[ORF02228 1S1381, transposase OrfB 



1 ORF02229 IS1381, transposase OrfA 



QRFQ2233 conserved hypothetical protein 



ORF02234 conserved hypothetical protein 

loRF02276 5-methyltetrahydropteroyltriglutamate--homocysteine methyltransferase (metE) 
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ORF02278 branched-chain amino acid transport protein AzlC, putative 

QRF02288 giycosyl transferase, family 8 

ORFQ2289 glycosyl transferase, family 8 _ . 

QRF02341 ribosomaf protein L32 (rpmF) a 

QRF02343 conserved hypothetical protein 

ORFQ2358 sensor histidine kinase 

ORF02369 conserved hypothetical protein 

ORF02384 LysM domain protein 

QRF02428 hypoxanthine-guanine phosphoribosyltransferase (hpt) 

QRF03011 ribosomal protein L33 

ORF03014 ribosomal protein L33 __ 
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QRFQQ064 rlbosomal protein S14, putative 

ORFQ0095 D-alanyl-D-alanine carboxypeptidase family protein 

ORF00096 N-acetylmuramoyl-L-alanine amidase, family 4 protein 

ORF001 10 conserved hypothetical protein 

QRFQQ1 12 DNA repair protein RadA (radA) 

ORF00124 permease, putative 

ORFQ0148 giycosyl transferase, group 4 family protein 

QRF00154 penicillin-binding protein 4, putative 

ORFQ0157 oligopeptide ABC transporter, permease protein 

ORF00206 oligopeptide ABC transporter, oligopeptide-binding protein 

ORFQ0207 oligopeptide ABC transporter, permease protein 

ORF002Q8 oligopeptide ABC transporter, permease protein 

ORF00209 peptide ABC transporter, ATP-binding protein 

QRFQ021Q peptide ABC transporter, ATP-binding protein 

ORF00216 IS1548, transposase 

ORF00226 conserved hypothetical protein 

QRFQ0232 conserved hypothetical protein 

ORF00239 site-specific recombinase, phage integrase family 

ORF00250 conserved hypothetical protein 

ORF00251 conserved hypothetical protein 

ORF00289 ABC transporter, ATP-binding protein 

ORF00305 NADH oxidase, putative 

ORFQ031 7 cell division protein FtsL, putative 

ORF00333 conserved hypothetical protein / 

ORF00383 hydrolase, haloacid dehalogenase-like family 

ORF00430 expressed putative lipoprotein 

ORFQ0431 transcriptional repressor CopY 

QRF0Q434 membrane protein, putative 

ORF00438 transcriptional regulator, Fur family 

ORFQ0442 membrane protein, putative 

ORFQ0445 bioY family protein 

ORF00446 AtsA/ElaC family protein 

ORF00468 expressed putative protease 

ORF00469 giycosyl transferase, group 2 family protein 

ORFQ0471 nrdl protein (nrdi) 

ORF00473 expressed protein of unknown function 

ORF00474 conserved hypothetical protein 

ORF00507 conserved hypothetical protein 

ORF00525 bioY family protein 

QRFQQ528 thiolase 

ORFQ0531 AMP-binding enzyme domain protein 

ORF00548 YGGT family protein 

QRF00565 exodeoxyribonuclease VII, small subunit (xseB) 

ORF0Q568 arginine repressor ArgR, putative 

ORF0Q572 expressed putative I ipase/acyl hydrolase 

ORF00573 conserved hypothetical protein 

ORF00586 iron-sulfur cluster-binding protein, putative 

ORF00592 oxidoreductase, short chain dehydrogenase/reductase family 

ORFQQ604 dipeptidase 

QRFQ061 1 voltage-gated chloride channel family protein 

ORF00619 prophage LambdaSal, repressor protein, putative 

ORF00622 conserved hypothetical protein 

QRFQ0627 prophage LambdaSal, antirepressor, putative 

ORF00634 conserved hypothetical protein 

ORF0Q648 conserved hypothetical protein 
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ORFxxxxx Annotation 

ORFQQ654 conserved hypothetical protein 

ORFQ0655 conserved hypothetical protein 

QRF00656 conserved hypothetical protein 

QRF00658 conserved hypothetical protein 

ORF00659 conserved hypothetical protein 

ORF00660 prophage LambdaSal, structural protein, putative 

ORFQ0662 conserved hypothetical protein 

ORF00663 conserved hypothetical protein 

ORF00664 conserved hypothetical protein 

ORF00665 conserved hypothetical protein 

ORF00666 prophage LambdaSal , structural protein 

ORF0Q668 conserved hypothetical protein 

ORF00669 prophage LambdaSal, pbIA protein, internal deletion 

ORF00677 prophage LambdaSal, lysin, putative 

QRF00679 conserved hypothetical protein 

ORFQQ695 transposase OrfB, 1S3 family, truncation 

ORF0Q697 conserved hypothetical protein 

ORF00707 conserved domain protein 

ORF00713 acid phosphatase precursor, class B 

QRFQ0720 transposase OrfB, 1S3 family FRAMESHIFT 

QRF00721 transposase OrfA, 1S3 family 

ORF00751 cylA protein (cylA) 

QRF0Q755 cyll protein (cyll) 

ORF00760 serine protease, subtilase family, putative POINT MUTATION 

ORF00781 transcriptional regulator, LysR family 

QRFQ0783 regulatory protein, putative 

ORF00785 1S1548, transposase 

ORF00786 regulatory protein, putative, truncation 

ORF00787 D-iactate dehydrogenase (IdhA) 

ORF00801 glycosyl transferase, group 1 family protein 

ORF00805 conserved hypothetical protein 

ORF00826 phage shock protein C, putative 

QRFQ0833 conserved hypothetical protein 

ORF00845 hydrolase, haloacid dehalogenase-like family 

ORF00852 conserved hypothetical protein 

ORFQ0853 expressed putative lipoprotein 

ORF00857 1S1548, transposase 

ORF00890 conserved hypothetical protein 

ORF009Q2 conserved hypothetical protein 

ORF00926 membrane protein, putative 

ORFQQ927 membrane protein, putative 

ORF00987 conserved hypothetical protein 

QRF010Q9 expressed protein of unknown function 

ORF01010 lipoyl-binding domain protein 

ORF01011 oxidoreductase, putative 

ORF01012 conserved hypothetical protein 

ORF01024 expressed putative lipoprotein 

ORF01Q61 signal peptidase I, putative 

ORF01Q64 IS1548, transposase 

ORF01084 glyoxylase family protein 

QRF01104 SatD 

ORF01126 conserved hypothetical protein 

ORF01191 conaerved hypothetical protein 

ORF01 1 92 conserved hypothetical protein 

ORF01193 glycine cleavage system H protein, putative --— — 
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QRFQ1 194 bacterial luciferase family protein 
QRF01 1 95 oxidoreductase, FMN-binding 
ORF01 197 lipoate-protein Hgase A family protein 
ORF012Q2 1S861 , transposase OrfA 

QRF01223 drug resistance transporter, EmrB/QacA family, putative 

QRFQ1224 conserved hypothetical protein 

ORF01225 potassium uptake protein, putative 

QRF01237 membrane protein, putative 

ORF01249 dihydroneopterin aldolase (folB) 

ORF01256 polysaccharide deacetylase family protein 

ORF01273 transcriptional regulator, GntR family/potassioum uptake protein, TrkA family 

QRF01280 conserved hypothetical protein 

ORFQ1281 conserved hypothetical protein 

QRF01289 lipoprotein, putative 

ORF01291 conserved hypothetical protein 

ORF01298 conserved hypothetical protein 

ORF01318 conserved hypothetical protein 

ORF01320 voltage-gated chloride channel family protein, putative 

ORF01322 arsenate reductase (arsC) 

ORF01340 dTDP-glucose 4,6-dehydratase (rfbB) 

ORF01341 dTDP-4-dehydrorhamnose 3,5-epimerase 
ORF01342 glucose-1 -phosphate thymidylyltransferase (rfbA) 

ORF01356 hypothetical protein 

ORF01368 conserved hypothetical protein 

QRF01374 ISSdyl , transposase OrfB 

ORF01388 transposase OrfA, 1S3 family 

ORF01389 transposase OrfB, IS3 family, truncation 

ORF01391 ISSdyl , transposase OrfB FRAMESH1FT 

ORF01396 transcriptional regulator, Cro/CI family 

ORF01419 repressor protein, putative 

ORF01461 amino acid permease 

ORF01469 conserved hypothetical protein 

ORFQ1483 sensor histidine kinase 

ORFQ1485 GTP pyrophosphokinase family protein 

ORF01490 S'-nucleotidase family protein 

QRF015Q9 2-dehydropantoate 2-reductase, putative 

QRF01510 regulatory protein, putative 

ORF01522 carbamoyl-phosphate synthase, large subunit, putative 

ORF01542 sulfatase 

ORF01549 conserved hypothetical protein 

QRFQ1554 iron compound ABC transporter, substrate-binding protein 

ORF01557 conserved hypothetical protein 

QRF01563 conserved hypothetical protein TIGRQ1212 

ORF01583 glycosyltransferase, group 2 family protein 

ORF01584 glycosyltransferase, group 2 family protein 

ORF01585 glycosyltransferase, putative 

ORF01586 dTDP-4-dehydrorhamnose reductase (rfbP) 

ORF01593 conserved hypothetical protein 

ORF01599 conserved hypothetical protein 

ORF01600 glycerol-3-phosphate transporter, putative 

ORF01639 conserved hypothetical protein 

ORFQ1650 nitroreductase family protein 

ORF01653 amino acid permease " ~ "~~ 

ORF01665 transcriptional regulator, MutR family 

QRF01683 MutT/nudix family protein 
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ORF01686 67 kDa Myosin-crossreactive streptococcal antigen 

QRF01688 peptide methionine sulfoxide reductase (msrA) 

QRFQ1694 peptide ABC transporter, permease protein 

ORF01704 conserved hypothetical protein ~ 

ORFQ1705 1S861, transposase OrfA 

ORF01741 membrane protein, putative 

QRF01770 conserved hypothetical protein ~ 

QRFQ1772 1S1548, transposase 

QRF01790 conserved hypothetical protein 
ORF01794 conserved hypothetical protein 

ORF0180Q amino acid ABC transporter, substrate-binding protein 

ORF01810 1S1548, transposase 

QRF01827 sodium:dicarboxyiate symporter family protein 

ORFQ1877 immunogenic secreted protein, putative 

QRFQ1913 transcriptional regulator, Cro/Cl family 

QRF01928 membrane protein, putative ~" 

ORF01931 transporter, putative 

ORF01932 transcriptional regulator, Crp/Fnr family 

QRFQ1947 transcriptional regulator, merR family 

QRF01970 acid phosphatase 

ORF02002 amino acid ABC transporter, permease protein 

QRF02028 perfringolysin O regulator protein (pfoR) 

QRFQ2029 conserved hypothetical protein 

ORF02031 expressed protein of unknown function 

ORF02032 expressed protein of unknown function 

ORF02035 deoxynucleoside kinase family protein 

ORF02042 alkyl hydroperoxide reductase, subunit C (ahpC) 

ORF02126 transcriptional regulator, MarR family 

ORF02128 N-acetylmuramoyl-L-alanine amidase, family 4 protein 

ORF02135 malate oxidoreductase 

ORF02136 citrate carrier protein, CCS family 

ORF02137 sensor histidine kinase family protein "~ 
ORF02138 response regulator 
ORF02166 conserved hypothetical protein 

ORF02169 PTS system, HB component 

QRF0217Q PTS system, HA component, putative 

ORF02202 ABC transporter, ATP-binding protein 

ORF02262 ABC transporter, ATP-binding protein 

ORF02270 cAMP factor (cfb) 

ORF0228Q serine protease, subtilase family, putative 

ORF02286 major facilitator family protein 

ORF02292 preprotein translocase, SecE subunit, putative 

QRFQ2295 Lyme disease proteins of unknown function, putative 

ORF02298 Na+ dependent nucleoside transporter 

ORF023Q1 transcriptional regulator, GntR family 

ORF02313 virulence factor MviM, putative 
ORF02316 membrane protein, putative 

QRFQ2319 conserved hypothetical protein TIGRQ0250 

ORF02328 transporter, putative 

ORF02331 cold shock protein, CSD family 

ORF02332 DNA mismatch repair protein HexA (hexA) 

QRF02335 conserved hypothetical protein 

ORF02372 conserved hypothetical protein 

ORF02383 expressed putative lipoprotein 

ORF02393 transporter, putative 
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ORF02398 transcriptional regulator, Crp/Fnr family 

ORF02399 conserved hypothetical protein 

ORF02401 acetyltransferase, GNAT family 

ORF02403 arginine/ornithine antiporter (arcD) 

ORF03002 conserved hypothetical protein, truncation v 



506 



WO 2004/018646 



PCT/US2003/026827 



Table 1 1 : GBS genes not shared with GAS or pneumococcus 

ORFxxxxx Annotation 

ORF00008 protease, putative 

QRFQQQ1Q acyl carrier protein (acpP) -— — — 

ORF00016 acetyltransferase, GNAT family ~ 

ORF00018 peptidase, M23/M37 family, putative secreted protein 

ORF00035 membrane protein, putative 

ORF0QQ87 lipoprotein, putative 

ORF0Q088 hypothetical protein 

ORF00089 hypothetical protein 

ORF00091 conserved hypothetical protein — 
ORF00117 ribose ABC transporter, periplasmic D-ribose-binding protein (rbsB) 
QRF00118 ribose ABC transporter, permease protein (rbsC) 

ORF00120 ribose ABC transporter protein RbsD (rbsD) 

ORFQ0121 ribokinase (rbsK) 

ORF00123 hypothetical protein 

ORF00130 argininosuccinate lyase (argH) 

ORF00137 conserved hypothetical protein 

ORF00138 hypothetical protein 

ORFQ0166 4-diphosphocytidyl"2C-methyl-D~erythritol kinase (ispE) 

QRF00182 conserved domain protein 

QRFQ0186 transcriptional regulator, Cro/Cl family 

ORF00187 hypothetical protein 

ORF00188 hypothetical protein 

ORF00192 hypothetical protein 

QRFQ0193 conserved hypothetical protein 

ORF00196 conserved hypothetical protein 

ORF00199 hydrolase, haloacid dehalogenase-like family 

ORF0Q200 sensor histidine kinase, putative 

ORF00201 response regulator 

ORF00203 conserved hypothetical protein 

ORFQ0204 membrane protein, putative 

ORF00205 hypothetical protein 

ORF0Q228 lipoprotein, putative 

ORF00234 hypothetical protein 

ORF00235 hypothetical protein 

ORF00238 hypothetical protein 

ORF00240 transcriptional regulator, Cro/Cl family 

QRF00241 hypothetical protein 

ORF00242 conserved hypothetical protein 

ORF00243 hypothetical protein 
ORF00244 conserved domain protein 

ORFQ0245 conserved hypothetical protein, fusion 

ORF00246 replication initiation protein, putative " 

QRF0Q247 hypothetical protein 

ORF00248 recombination protein 

ORFQ0249 hypothetical protein 

ORF00252 conserved hypothetical protein — — 

ORF00253 hypothetical protein 

QRF00254 hypothetical protein 

ORF0Q255 hypothetical protein 

QRF00256 hypothetical protein 

QRF00257 hypothetical protein 

ORF00258 hypothetical protein 
ORF00259 hypothetical protein 

ORF00260 hypothetical protein 

ORFQ0272 expressed putative lipoprotein ^^———.^^ 
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ORFQQ273 hypothetical protein 



ORF00274 hypothetical protein 



ORF00275 hypothetical protein 



QRFQ0276 hypothetical protein 



ORFQ0278 membrane protein, putative 



ORF0Q279 transcriptional regulator, Cro/CI family 



ORF00280 acetyltransferase, GNAT family 



ORF00281 acetyltransferase, GNAT family 



ORF00283 conserved hypothetical protein 



ORFQ0284 RNA polymerase sigma factor, ECF subfamily 



ORF00285 lipoprotein, putative 



QRF00287 transcriptional regulator, TetR family 



ORF00288 ABC transporter efflux protein, DrrB family, putative 



ORF00292 hypothetical protein 



ORF00294 expressed protein of unknown function 



ORF00298 acyl carrier protein phosphodiesterase, putative 



ORF00308 conserved hypothetical protein 



ORF00324 conserved hypothetical protein 



ORF00332 hypothetical protein 



ORF00340 hypothetical protein 



ORF00347 conserved hypothetical protein 



ORF00384 hypothetical protein 



ORF00402 membrane protein, putative 



ORFQ0408 hypothetical protein 



ORFQ0409 membrane protein, putative 



QRF00414 conserved hypothetical protein 



ORFQ0416 hypothetical protein 



ORFQ0417 hypothetical protein 



ORF00433 copper-transporter protein CopZ 



ORF00448 hypothetical protein 



ORF00466 conserved hypothetical protein 



ORF00467 acetyltransferase, GNAT family 



ORF00475 conserved domain protein 



ORF00476 hypothetical protein 



ORF00478 carboxymuconolactone decarboxylase family protein 



ORF00479 conserved hypothetical protein 



QRF00486 transcriptional regulator, AraC family 



ORF00487 surface protein Rib 



ORF00488 transposase, IS256 family, truncation 



ORF00489 DNA-damage-inducible protein J, putative 



QRF00490 hypothetical protein 



QRF00491 lipoprotein, putative 



ORF00493 bacteriophage L54a, integrase, truncation 



ORF0Q497 conserved domain protein 



QRF00503 oxidoreductase, Gfo/ldh/MocA family 



QRF00506 transposase, IS256 family 



QRF0Q510 bacteriocin transport accessory protein, putative 



QRF00512 hypothetical protein 



ORF00526 biotin synthetase (bioB) 



ORF00527 hypothetical protein 



DRF00533 type IV prepilin peptidase-related protein 



ORF00538 conserved hypothetical protein 



ORFQ0556 hypothetical protein 



ORF00563 expressed protein of unknown function 



ORF00575 hypothetical protein 
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ORF00584 conserved hypothetical protein 

ORF00585 fructose- 1 ,6-bisphosphatase, putative 

QRF00590 carboxymethylenebutenolidase-related protein 

ORF0Q597 conserved hypothetical protein 

QRF0Q598 inosine-uridine preferring nucleoside hydrolase 

ORF00599 hypothetical protein 

ORFQQ60Q OsmC/Ohr family protein 

ORF00608 adenosine deaminase, putative 

ORF00610 chorismate mutase, putative 

ORF0Q615 prophage LambdaSal, site-specific recombinase, phage integrase family 

ORF00617 conserved domain protein 

ORF00618 hypothetical protein 

ORF0Q620 hypothetical protein 
ORF0Q621 conserved hypothetical protein 

ORF0Q623 hypothetical protein 

ORFQ0624 hypothetical protein 

ORF00626 prophage LambdaSal, transcriptional regulator, Cro/CI family 

ORF0Q628 hypothetical protein 

ORF00630 hypothetical protein ~ 

ORF00632 hypothetical protein 

ORF00633 conserved hypothetical protein 
ORF00635 hypothetical protein 

ORF00636 hypothetical protein "™ — 

ORF00637 hypothetical protein ~~ 

ORF0Q638 conserved hypothetical protein 

ORF00639 conserved domain protein 

ORF00641 prophage LambdaSal , reverse transcriptase/maturase family protein 

ORF00642 conserved hypothetical protein 

ORF00643 conserved hypothetical protein 

ORF00644 hypothetical protein 

ORF00645 hypothetical protein 

ORF00646 conserved hypothetical protein 

ORFQ0647 hypothetical protein 

ORF00649 hypothetical protein 

ORF00650 hypothetical protein 

ORFQ0652 conserved hypothetical protein 

ORF00653 conserved hypothetical protein __ 

ORF00657 conserved hypothetical protein, truncation 
ORF00661 conserved hypothetical protein 
ORF00667 conserved hypothetical protein 

ORF00670 prophage LambdaSal , minor structural protein, putative 

ORF00671 prophage LambdaSal, N-acetylmuramoyl-L-alanine amidase, family 4 

ORF00672 prophage LambdaSal, minor structural protein, putative 

'ORF00673 hypothetical protein 

DRF00674 hypothetical protein — 

DRF00675 conserved hypothetical protein 

ORF00676 conserved hypothetical protein 

ORF00678 conserved hypothetical protein 

ORF0Q681 conserved hypothetical protein 

ORF00682 hypothetical protein 

DRF00683 prophage LambdaSal, site-specific recombinase, phage integrase family FRAMESH1FT 
DRF00685 conserved hypothetical protein 

ORF00689 conserved hypothetical protein, FRAMESHIFT 

ORF00698 hypothetical protein 

ORFQ0703 phosphoserine phosphatase SerB (serB) ~~ 
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ORFQQ7Q4 MutT/nudix family protein 

QRFQ0712 hypothetical protein 

ORF0Q718 cell wall surface protein, interruption-N 

QRF00723 hypothetical protein 

ORF00726 transcriptional regulator, AraC family 

QRF00727 expressed cell wall surface anchor family protein 
QRF00728 expressed cell wall surface anchor family protein 

ORF00735 expressed protein of unknown function 

ORFQ0737 conserved hypothetical protein, degenerate 

ORF00738 hypothetical protein 

ORF00740 hypothetical protein 

ORF00741 hypothetical protein 

ORF00742 lipoprotein, putative 

ORF00747 cylD protein (cylD) 

ORF00749 acyl carrier protein AcpC 

ORF00750 cylZ protein FRAMESHIFT 

ORF00752 cylB protein (cylB) 

ORF00753 cylE protein (cylE) 

ORF0Q754 cylF protein (cylF) 

QRFQ0756 cylJ protein (cylJ) 

ORF00757 cylK protein (cylK) 

ORF00758 hypothetical protein 

QRF00759 putative secreted protein 

ORF00761 hypothetical protein 

QRF00766 expressed putative secreted protein 

ORF0Q767 hypothetical protein 

QRF00768 conserved domain protein 

QRF0Q769 permease, putative 

ORF00775 conserved hypothetical protein 

ORF0Q777 DedA family protein, putative 

ORF00779 membrane protein, putative 

ORF00788 sodium :galactoside symporter family protein, putative 

ORF0Q791 transcriptional regulator, GntR family 

ORF00793 Glucuronate isomerase (uxaC) 

ORF00794 mannonate dehydratase (uxuA) 

ORF00795 D-mannonate oxidoreductase 

ORF00796 hydrolase, haloacid dehalogenase-iike family 

ORF00797 glycosyl hydrolase, family 3 

ORF00806 conserved hypothetical protein 

QRF00822 ABC transporter, ATP-binding protein 

ORF00827 hypothetical protein 

ORF00834 conserved hypothetical protein 

QRF00838 membrane protein, putative 

ORFQ0839 Mn2+/Fe2+ transporter, NRAMP family 

ORF00848 conserved domain protein 

ORF00872 cell wall surface anchor family protein 

ORF00874 conserved hypothetical protein 

QRF00878 ABC transporter, permease protein 

ORF00879 YaeC family protein, putative 

QRF00888 hydrolase, haloacid dehalogenase-like family 

ORF00891 conserved domain protein 

ORF0Q898 conserved hypothetical protein 

ORF00900 permease, GntP family 

ORF00903 transcriptional regulator, MarR family 

ORF00907 glutathione S-transferase family protein 
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ORF00909 hypothetical protein 

ORFQ0921 membrane protein, putative 

QRFQ0922 glycosyl transferase, family 8 

ORF00923 hypotheticai protein 

ORF00924 conserved hypothetical protein 

QRFQ0939 conserved hypothetical protein 

ORF00942 expressed putative secreted protein 

ORF00943 hypotheticai protein 

ORF0Q944 hypothetical protein 

ORF00946 conserved hypothetical protein 

ORF0Q950 hypothetical protein 

QRF00951 transcriptional regulator, TenA family 

ORF00972 ATP synthase F0, C subunit (atpE) 

ORF00980 conserved hypothetical protein 

QRFQ0982 conserved hypothetical protein 

ORF01003 conserved hypothetical protein 

QRF01004 conserved hypothetical protein 

ORF01013 hypothetical protein 

ORF01014 hypothetical protein 

ORF01015 hypothetical protein 

ORF01016 hypothetical protein 

ORF01018 hypothetical protein 

ORF01019 hypothetical protein 

ORF01021 hypothetical protein 

ORF01Q25 HP domain protein 

ORF01Q26 acetyltransferase, GNAT family 

ORF01032 chloramphenicol acetyltransferase (cat) 

ORF01034 Tn916, transposase 

QRF01035 Tn916 t excisionase 

ORF01037 Tn916 t hypothetical protein 

QRF01038 Tn916, hypothetical protein 

ORFQ1039 Tn916, transcriptional regulator, putative 

ORF01041 Tn916, hypothetical protein 

QRF01Q42 Tn916, NLP/P60 family protein 

ORF01044 membrane protein, putative FRAMESHIFT 

ORF01048 Tn916, hypothetical protein 

ORF01049 Tn916, hypothetical protein 

ORF01050 Tn916, hypothetical protein 

ORF01Q51 Tn916, transcriptional regulator, putative 

ORF01052 Tn916, FtsK/SpolllE family protein 

QRF01053 Tn916, hypothetical protein 

QRF01054 Tn916 t hypothetical protein 

ORF01062 hypothetical protein 

QRF01086 Na+/H+ exchanger family protein 

QRF01092 acetyltransferase, GNAT family 

ORF01096 nisin-resistance protein, putative 

ORF01103 conserved hypothetical protein 

ORF01 124 acetyltransferase, GNAT family 

ORF01133 iron-compound ABC transporter, iron-compound-binding protein 

ORF01140 conserved hypothetical protein 

ORF01142 carbon starvation protein CstA, putative 

QRF01 143 response regulator 

ORF01 144 sensor hlstidlne kinase, putative 

ORF01145 lipoprotein, putative 

ORF01146 conserved hypothetical protein, FRAMESHIFT 
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ORF01148 lipoprotein, putative 

ORFQ1149 hypothetical protein 
ORF01150 hypothetical protein 

ORF01151 hypothetical protein 

ORF01152 lipoprotein, putative 

ORF01 1 53 hypothetical protein 

QRF01157 conserved hypothetical protein 

ORF01158 hypothetical protein 

ORF01 1 59 hypothetical protein 

ORF01 160 expressed protein of unknown function FRAMESH1FT 

jORF01161 expressed conserved domain protein 

IQRF01162 conserved hypothetical protein 

ORF01164 FtsK/SpolllE family protein FRAMESHIFT 

ORFQ1166 hypothetical protein 

1ORF01167 conserved hypothetical protein 

ORF01168 conserved hypothetical protein 

ORF01169 hypothetical protein 

ORF01172 phage infection protein, putative 

ORF01173 conserved hypothetical protein 

ORF01 174 conserved domain protein 

ORF01175 hypothetical protein 

ORF01182 membrane protein, putative 

ORF01 186 cell wall surface anchor family protein, putative 

ORF01 187 hypothetical protein 

ORF01204 hypothetical protein 

ORF01215 hypothetical protein 

ORF01241 transcriptional regulator, AraC family, putative 
ORF01253 rarD protein (rarD) 
ORF01257 transporter, BCCT family protein 
ORF01258 hypothetical protein 
ORFQ1261 expressed protein of unknown function 
1ORF01262 conserved hypothetical protein, FRAMESHIFT 
ORF01263 hypothetical protein 
ORF01265 hypothetical protein 
QRF01266 hypothetical protein 
ORF01269 conserved hypothetical protein 
QRF01272 conserved hypothetical protein 
ORF01277 conserved hypothetical protein 
ORF01287 consen/ed hypothetical protein 
QRFQ1288 membrane protein, putative 

ORF01299 CMP-N-acetylneuraminic acid synthetase NeuA (neuA) 
QRF01 300 neuP protein (neuP) ~~ 
ORF01301 UPP-N-acetylglucosamine-2-epimerase NeuC (neuC) 
ORF01302 N-acetyl neuramic acid synthetase NeuB (neuB) 
ORF01303 polysaccharide biosynthesis protein CpsL (cpsL) 
ORF01304 polysaccharide biosynthesis protein CpsK(V) (cpsK) 
ORF01307 glycosyltransferase CpsN(V) (cpsN) 
ORF01308 polysaccharide biosynthesis protein CpsM(V) (cpsM) 
ORF01309 polysaccharide biosynthesis protein cpsH(V) (cpsH) ~~ 
ORF01 31 0 glycosyltransferase CpsG(V) (cpsG) 
QRF01311 polysaccharide biosynthesis protein CpsF (cpsF) 
ORF01312 glycosyltransferase CpsE (cpsE) 
[ORF01348 conserved domain protein 

QRF01 349 hypothetical protein ~ 
|ORF01370 conserved hypothetical protein 



512 



WO 2004/018646 



PCT/US2003/026827 



Table 1 1 : GBS genes not shared with GAS or pneumococcus 



ORFxxxxx Annotation 

ORF01371 conserved hypothetical protein 

ORF01372 expressed protein of unknown function 

ORF01373 ISSdyl, transposase OrfA 

ORF01375 conserved hypothetical protein 

ORF01379 transposase OrfB, IS3 family, truncation 

ORF01382 GBSil, group II intron, maturase 

QRF01384 hypothetical protein 

ORF01385 hypothetical protein 

QRF01386 conserved hypothetical protein 

QRF01387 conserved hypothetical protein, truncation 

ORF01390 ISSdyl, transposase OrfA FRAMESHIFT 

ORF01392 hypothetical protein 

ORF01393 hypothetical protein 

ORF01394 site-specific recombinase, phage integrase family 

QRF01395 conserved hypothetical protein 

QRF014Q1 transposase, 1SL3 family 

ORF01404 mercuric resistance operon regulatory protein MerR (merR) 

QRFQ1408 cadmium efflux system accessory protein (CadC) 

ORF01409 conserved hypothetical protein 

ORF01410 hypothetical protein 

ORF01417 hypothetical protein 

ORF01418 hypothetical protein 

ORF01420 hypothetical protein 

ORF01421 ImpB/MucB/SamB family protein 

ORF01423 conserved hypothetical protein 

ORF01424 conserved hypothetical protein 

ORF01425 conserved hypothetical protein 

ORF01426 conserved hypothetical protein 

QRF01427 hypothetical protein 

ORF01428 conserved hypothetical protein 

ORF01430 hypothetical protein 

ORF01431 hypothetical protein 

ORF01432 conserved domain protein 

ORF01433 SNF2 family protein 

ORF01434 hypothetical protein 

ORF01435 calcium-binding protein, putative 

ORF01436 agglutinin receptor (ssp-5) 

ORF01437 abortive infection protein AbiGl (abiGI) 

ORFQ1438 abortive infection protein AbiGIl (abiGIl) 

ORF01439 conserved hypothetical protein 

ORF01440 expressed protein of unknown function 

ORFQ1441 conserved hypothetical protein, degenerate 

ORF01442 membrane protein, putative 

ORF01443 hypothetical protein 

ORF01444 Tn5252, Orf 21 protein, internal deletion 

ORF01445 hypothetical protein 

ORF01450 conserved hypothetical protein 

ORFQ1452 hypothetical protein 

ORF01454 conserved hypothetical protein 

ORF01459 hypothetical protein 

ORFQ1460 homocysteine S-methyltransferase MmuM, putative 

ORF01463 hypothetical protein 

ORF01464 hypothetical protein 

QRF01465 hypothetical protein 

ORF01466 transcriptional regulator, TetR family 
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ORF01477 glutathione S-transferase family protein, putative 
ORFQ1478 conserved domain protein 
ORF01486 hypothetical protein 

ORF01488 R5 protein 

QRF01489 transcriptional regulator, MarR family, putative 

ORF01494 membrane protein, putative 

ORF01497 acetyltransferase, GNAT family 

ORF01502 hypothetical protein 

ORF01503 conserved hypothetical protein 
QRF01508 surface antigen-related protein 
QRF01535 conserved hypothetical protein 

QRF01547 conserved hypothetical protein — — 

QRF01 566 expressed cell wall surface anchor family protein 
ORF01572 glycosyltransferase, group 1 family protein 
QRF01573 glycosyltransferase, group 2 family protein 

ORF01575 membrane protein, putative 

ORF01576 glycosyltransferase, group 2 family protein 
ORF01 577 glycosyltransferase, group 2 family protein 
ORFQ1578 nucleotide sugar dehydratase, putative 

ORF01581 lipoprotein, putative — 

ORF01582 conserved hypothetical protein 

ORFQ1596 ammonium transporter family protein 

ORFQ1597 conserved hypothetical protein 

ORF01601 hypothetical protein 

ORF01608 proton/peptide symporter family protein 

ORF01611 hypothetical protein 

ORF01615 conserved domain protein 

ORF01638 conserved hypothetical protein 

ORF01641 conserved hypothetical protein 

QRF01645 cell wall surface anchor family protein 

QRF01660 membrane protein, putative 
ORF01661 ABC transporter, ATP binding protein 

QRF01666 hypothetical protein 

ORF01667 hypothetical protein 

ORF01670 hypothetical protein 

ORF01672 protease, putative, POINT MUTATION 

ORF01673 hypothetical protein 

ORF01674 hypothetical protein 

ORF01675 hypothetical protein 

ORF01680 tetracenomycin polyketide synthesis O-methyltransferase TcmP, putative 

ORFQ1681 hypothetical protein 

ORF01682 hypothetical protein 

ORF01684 hypothetical protein 

ORF01692 peptide ABC transporter, ATP-binding protein 
ORF01 695 peptide ABC transporter, permease protein 
ORF01696 peptide ABC transporter, peptide-binding protein 
ORF01699 transposase, IS30 family, putative 
ORF01 700 transporter, major facilitator family 

ORF017Q3 transcriptional regulator, LysR family 

ORF01 71 5 conserved hypothetical protein ~~ 
ORF01 71 9 hypothetical protein 

ORF01720 conserved hypothetical protein 

ORF01721 glyoxalase family protein 

ORF01727 conserved hypothetical protein 

ORF01729 acetyltransferase, GNAT family 
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ORF01730 glycosyl transferase, group 2 family protein 

ORF01733 hypothetical protein 

ORF01734 conserved hypothetical protein 
ORF01 735 hypothetical protein 
QRFQ1736 hypothetical protein 
ORF01737 hypothetical protein 
ORF01742 hypothetical protein 
ORF01 743 PTS system component, putative 

ORF01 744 conserved hypothetical protein 

ORF01 748 D-isomer specific 2-hydroxyacid dehydrogenase family protein 
ORFQ1753 conserved hypothetical protein 
ORF01 754 hypothetical protein 

ORF01761 transposase, IS30 family, putative, truncation 

ORF01778 amino acid permease, putative 

QRF01807 hypothetical protein 

QRF01836 hypothetical protein 

ORF01838 hypothetical protein 

ORF01839 dihydroxyacetone kinase family protein 

ORF01840 transcriptional regulator, TetR family, putative 

ORF01842 hypothetical protein 

ORF01843 dihydroxyacetone kinase family protein 
ORF01844 dihydroxyacetone kinase family protein 
ORF01847 conserved hypothetical protein 
ORF01850 hypothetical protein 

ORF01863 pyruvate phosphate dikinase (ppdK) 

QRF01864 expressed protein of unknown function 

ORF01865 CBS domain protein 

ORF01866 3-hydroxyacyl-CoA dehydrogenase family protein, putative secreted protein 
ORF01892 hypothetical protein 

ORF01893 hypothetical protein 

ORF01894 conserved hypothetical protein 

ORF01895 hypothetical protein 

ORF01896 hypothetical protein 

QRFQ1897 hypothetical protein 

ORF01 898 hypothetical protein 

ORF01899 hypothetical protein 

ORF01903 conserved hypothetical protein 

ORF01904 drug resistance transporter, EmrB/QacA family 

ORF01905 hypothetical protein 

QRF01922 conserved hypothetical protein 

ORF01925 FMN-binding protein 

QRF01934 hypothetical protein 

ORF01936 polyprenyl synthetase family protein 

ORF01939 cytochrome d ubiquinol oxidase, subunit ll (cydB) 

ORF01940 cytochrome d oxidase, subunit I (cydA) 

ORF01941 pyridine nucleotide-disulphide oxidoreductase family protein 

ORF01942 prenyltransferase, UbiA family 

ORF01943 hypothetical protein 

ORF01944 hypothetical protein 

ORF01946 cyclopropane-fatty-acyl-phospholipid synthase (cfa) 

ORF01951 conserved hypothetical protein 

ORF01953 hypothetical protein 

ORF01 954 conserved hypothetical protein 

ORF01984 hypothetical protein 

QRF01988 hypothetical protein ^^^^ 
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ORF01989 hypothetical protein 
ORF01990 hypothetical protein 
ORFQ1991 hypothetical protein 
ORF02000 membrane protein, putative 
ORF020Q1 transposase, )S30 family, putative 
ORF02005 hypothetical protein 

QRF02QQ6 xylulose-5-phosphate/fructose-6-phosphate phosphoketolase (xfp) 

ORF02009 conserved hypothetical protein 

ORF0201Q carbohydrate kinase, FGGY family 

ORF02Q1 1 hypothetical protein 

ORF02012 PTS system component, putative 

QRFQ201 5 glyoxylate reductase, NADH-dependent 

ORFQ201 6 hypothetical protein 

QRF02025 hypothetical protein 

QRF02026 hypothetical protein 

ORFQ2030 glutamate-cysteine ligase-related protein 

ORF02036 phosphinothricin N-acetyltransferase (pat) 

QRF02039 conserved hypothetical protein 

ORF02044 conserved hypothetical protein 

ORF02045 conserved hypothetical protein — 

ORF02046 prophage LambdaSa2, lysin, putative 

ORF02047 prophage LambdaSa2, holin, putative 

QRF02048 conserved hypothetical protein 

ORF02049 hypothetical protein 

QRF0205Q conserved domain protein 

ORF02051 prophage LambdaSa2, PblB, putative 

ORF02Q53 conserved hypothetical protein 

ORF02056 conserved hypothetical protein 

ORF02057 hypothetical protein 

ORF02058 hypothetical protein 

ORF02059 conserved hypothetical protein 

ORF02060 conserved hypothetical protein "" — — — 

ORF02061 hypothetical protein 

ORF02062 hypothetical protein 
ORF02063 conserved domain protein 

ORF02064 conserved domain protein " 

ORF02066 prophage LambdaSa2, protease, putative 

ORF02Q67 conserved hypothetical protein ~~ 
ORF02068 prophage LambdaSa2, terminase large subunit, putative 
ORF02069 hypothetical protein 
QRF02Q7Q hypothetical protein 

QRFQ2071 prophage LambdaSa2, site-specific recombinase, phage integrase family 
ORF02072 conserved hypothetical protein ~ — 

ORF02073 prophage LambdaSa2, transcriptional regulator, Cro/CI family 
ORF02075 hypothetical protein 
ORF02Q77 hypothetical protein 
ORF02078 conserved hypothetical protein 

ORF02079 conserved hypothetical protein 

ORF02080 conserved hypothetical protein 
ORF02081 hypothetical protein 

ORF02084 prophage LambdaSa2, bacteriophage replication protein/hypothetical protein, 

truncation/fusion 

ORF02085 hypothetical protein 

ORF02087 hypothetical protein 

DRF02088 conserved hypothetical protein 
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ORF02Q89 prophage LambdaSa2, HNH endonuciease family protein 
ORFQ2090 prophage LambdaSa2, antirepressor protein, putative 

ORF02091 conserved domain protein 

ORF02Q92 hypothetical protein 
QRF02093 hypothetical protein 

ORF02094 hypothetical protein 

QRFQ2095 prophage LambdaSa2 t repressor protein, putative 

ORF02097 hypothetical protein 

ORF02098 prophage LambdaSa2, site-specific recombinase, phage integrase family 

QRF02100 hypothetical protein 

ORFQ2102 hypothetical protein 

ORF02103 microcin immunity protein MccF, putative 

ORF02105 oxidoreductase, Gfo/ldh/MocA family 

|ORF02108 hypothetical protein 

ORF02109 Cyclic nucleotide-binding domain protein 
ORF02119 hypothetical protein 
ORF02124 hypothetical protein 
ORF02125 nitroreductase family protein 
ORF021 34 bacteriocin transport accessory protein, putative 
ORFQ2148 neuraminidase-related protein 
ORF021 60 2\3^cyclic-nucleotide 2^phosphodiesterase (cpdB) 
ORF02163 conserved hypothetical protein 
IQRFQ2171 membrane protein, putative 
ORF02172 hypothetical protein 
ORF02173 membrane protein, putative 
|ORF02175 conserved hypothetical protein, truncation 
ORF02181 phosphate transport system regulatory protein PhoU, putative 
ORF02187 hypothetical protein 
ORF02190 conserved hypothetical protein 
ORF02191 hypothetical protein 
ORF02194 acetyltransferase, GNAT family 
ORF02196 hypothetical protein 
ORF02198 acetyltransferase, GNAT family 
ORF02201 membrane protein, putative 
ORF02203 hypothetical protein 
ORF02205 transcriptional regulator, Cro/Cl family 
ORF02206 conserved hypothetical protein 
ORF02207 conserved hypothetical protein TIGR00730 
ORFQ2208 hypothetical protein 

ORF02209 site-specific recombinase, phage integrase family 
[ORF02210 conserved hypothetical protein 
IORF02211 conserved hypothetical protein 
ORF02212 hypothetical protein 
ORF02213 hypothetical protein 
QRF02214 transcriptional regulator, Cro/Cl family 
|QRF02215 expressed protein of unknown function 
|ORF02216 site-specific recombinase, phage integrase family 
|ORF02217 conserved hypothetical protein 
ORF02219 hypothetical protein 
ORFQ2221 cell wall anchor protein-related protein 
ORF02223 hypothetical protein 
QRF02224 hypothetical protein 

QRFQ2225 hypothetical protein ■ 
ORF02226 membrane protein, putative 
|QRF02227 conjugal transfer protein, interruption-C 
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ORFQ2230 conserved hypothetical protein 

ORF02231 conserved hypothetical protein 

ORF02232 conserved hypothetical protein 

ORF02235 hypothetical protein 

ORF02236 conserved hypothetical protein 

ORF02237 hypothetical protein 

QRF02238 hypothetical protein 

ORF02239 hypothetical protein 

ORF02240 transcriptional regulator, Cro/CI family 

ORF02241 hypothetical protein 

ORF02242 transcriptional regulator, Cro/CI family 

ORF02243 FtsK/SpolllE family protein 

ORF02244 hypothetical protein 

ORF02245 hypothetical protein 

ORF02246 cell wall surface anchor family protein 

ORF02247 transposase, 1SL3 family 

ORF02250 mercuric resistance operon regulatory, protein MerR (merR) 

ORF02251 Mn2+/Fe2+ transporter, NRAMP family 

ORF02252 membrane protein, putative 

ORF02253 ABC transporter, ATP-binding protein 

ORF02254 conserved hypothetical protein 

ORFQ2255 streptomycin resistance protein 

ORF02257 hypothetical protein 

ORFQ2258 hypothetical protein 

ORF02259 conserved hypothetical protein . 

ORF02260 acetyltransferase, GNAT family 

ORFQ2261 membrane protein, putative 

QRF02263 hypothetical protein 

ORF02264 transcriptional regulator, Cro/CI family m 

ORF02265 PAP2 family protein 

ORF02266 conserved hypothetical protein FRAMESHIFT 

ORF02267 conserved hypothetical protein T1GR0Q73Q 

ORFQ2268 protease, putative 

QRF02269 rhodanese family protein 

ORF02271 hypothetical protein 

ORFQ2274 conserved hypothetical protein 

ORF02275 5-methyltetrahydrofolate-homocysteine methyltransf erase, putative 

ORF02277 conserved hypothetical protein 

ORF02279 hypothetical protein 

ORF02282 sensor histidine kinase 

ORF02283 chromosome assembly-related protein 

ORF02287 expressed protein of unknown function 

ORF02291 pathogenicity protein, putative 

ORF02308 hydrolase, haloacid dehalogenase-like family 

ORF02314 conserved hypothetical protein 

ORF02317 hypothetical protein 

ORF02330 hypothetical protein 

ORF02344 site-specific recombinase, phage integrase family 

ORF02345 conserved hypothetical protein 

ORF02346 conserved hypothetical protein 

ORF02347 hypothetical protein 

ORF02349 conserved hypothetical protein 

QRF0235Q hypothetical protein 

ORF02351 transcriptional regulator, Cro/CI family 

ORF02352 conserved domain protein 
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ORFQ2354 hypothetical protein 

ORF02356 expressed putative secreted protein 

QRF02362 sensor histidine kinase 

ORF02363 response regulator 

QRF02367 membrane protein, putative 

QRF02368 conserved hypothetical protein 

ORF02379 membrane protein, putative 

ORFQ2395 transcriptional regulator, Cro/Cl family 

ORFQ2406 membrane protein, putative 

ORF02416 diacylglycerol kinase catalytic domain protein, putative 

QRF02418 hypothetical protein 

ORF02422 hypothetical protein 

ORF02425 conserved hypothetical protein 

ORF03Q01 conserved hypothetical protein 

ORF03Q04 conserved hypothetical protein 

ORF03005 cylX protein 

ORFQ30Q6 Tn916, hypothetical protein 

ORFQ3007 Tn916, hypothetical protein 

ORF03008 Tn916, hypothetical protein 
QRF03009 Tn916, tetM leader peptide 
ORF03010 Tn916, hypothetical protein 

ORF03012 prophage LambdaSa2, HNH endonuclease family protein 
ORF0301 3 conserved hypothetical protein 
ORF03015 conjugal transfer protein, interruption-N 
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ORF00035 membrane protein, putative ~ 
ORF00087 lipoprotein, putative 
ORF00088 hypothetical protein 

QRF00089 hypothetical protein ~~~~~ 

ORF00123 hypothetical protein 

ORF00138 hypothetical protein ~" 
ORF00187 hypothetical protein 

ORF00188 hypothetical protein 

ORF00192 hypothetical protein 
QRF0Q205 hypothetical protein 

ORF00228 lipoprotein, putative " 
QRF00234 hypothetical protein ~ 
ORF00235 hypothetical protein 

ORFQ0238 hypothetical protein 

ORF0024Q transcriptional regulator, Cro/Ci family 
ORFQQ241 hypothetical protein 
ORF00242 conserved hypothetical protein 
ORF00243 hypothetical protein 
ORF00247 hypothetical protein 
ORF00249 hypothetical protein 

ORFQ0253 hypothetical protein 

ORFQ0254 hypothetical protein 
ORF00255 hypothetical protein 

QRFQQ256 hypothetical protein 

ORF00257 hypothetical protein 

QRF00258 hypothetical protein 
ORF00259 hypothetical protein 
ORF00260 hypothetical protein 

QRF00272 expressed putative lipoprotein __ 

ORFQ0273 hypothetical protein 
ORF00274 hypothetical protein 
QRF00275 hypothetical protein 

ORF00276 hypothetical protein 

ORF00278 membrane protein, putative 

ORF00285 lipoprotein, putative 

ORF00292 hypothetical protein 

ORF00294 expressed protein of unknown function 

ORF0Q308 conserved hypothetical protein 

ORF00332 hypothetical protein 

ORF00340 hypothetical protein 

ORFQ0384 hypothetical protein 

ORF0Q402 membrane protein, putative 

ORFQ0408 hypothetical protein 

ORFQ0416 hypothetical protein 

ORF00417 hypothetical protein 

ORF00448 hypothetical protein 

ORFQ0476 hypothetical protein 

ORF00489 DNA-damage-inducible protein J, putative 
ORF00490 hypothetical protein 

ORF0Q491 lipoprotein, putative 

ORF00497 conserved domain protein 

ORF00510 bacteriocin transport accessory protein, putative 
ORFQ0512 hypothetical protein 

ORF00527 hypothetical protein 

ORF00556 hypothetical protein 
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ORF00575 hypothetical protein 

QRFQQ599 hypothetical protein 

ORFQ0618 hypothetical protein ' 

ORF00620 hypothetical protein 

ORF00623 hypothetical protein ~~ 
QRFQ0626 prophage LambdaSal , transcriptional regulator, Cro/CI family 
QRF00628 hypothetical protein 

QRF00630 hypothetical protein 

ORF00632 hypothetical protein 

ORFQ0635 hypothetical protein ~~ 
QRFQ0636 hypothetical protein 
ORF00637 hypothetical protein 
QRFQ0642 conserved hypothetical protein 

QRF00644 hypothetical protein 

ORF00645 hypothetical protein 

QRFQ0647 hypothetical protein 

QRF0Q649 hypothetical protein 

ORFQ0650 hypothetical protein 
ORF00653 conserved hypothetical protein 

ORF0Q657 conserved hypothetical protein, truncation 

QRFQ0661 conserved hypothetical protein 

QRF00673 hypothetical protein 

ORF00674 hypothetical protein 

QRF00675 conserved hypothetical protein 

ORF00676 conserved hypothetical protein 

QRF00682 hypothetical protein 

ORF00685 conserved hypothetical protein 

ORF00698 hypothetical protein 

ORF0071 2 hypothetical protein . 

ORF00718 cell wall surface protein, interruption-N 

ORF00723 hypothetical protein 

QRF00735 expressed protein of unknown function 

ORF00737 conserved hypothetical protein, degenerate 

QRF00738 hypothetical protein 

QRF00740 hypothetical protein 

ORF00741 hypothetical protein 

ORF00747 cylD protein (cylD) 

ORF00753 cylE protein (cylE) 

ORF00756 cylJ protein (cylJ) 

QRF0Q757 cylK protein (cylK) 

ORFQ0758 hypothetical protein 

ORF00759 putative secreted protein 

ORF00761 hypothetical protein 

ORF00796 hydrolase, haloacid dehalogenase-iike family 

ORF00806 conserved hypothetical protein — — 
ORF00822 ABC transporter, ATP-binding protein 
ORF00827 hypothetical protein 

QRF00872 cell wall surface anchor family protein 

ORF009Q9 hypothetical protein 
ORF00923 hypothetical protein 
ORF00924 conserved hypothetical protein 
ORF00942 expressed putative secreted protein 
ORF00943 hypothetical protein 
ORF00944 hypothetical protein 

ORF01013 hypothetical protein ~~ ~ ~~ " 
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ORFQ1014 hypothetical protein — 

QRFQ1 01 5 hypothetical protein 

ORF01016 hypothetical protein 

ORFQ1Q18 hypothetical protein 

QRF01 01 9 hypothetical protein 

ORFQ1021 hypothetical protein 

ORF01 035 Tn91 6, excisionase — 
ORF01062 hypothetical protein — 
QRF01096 nisin-resistance protein, putative 
ORF01145 lipoprotein, putative 

ORF01 146 conserved hypothetical protein, FRAMESHIFT 
ORFQ1148 lipoprotein, putative 

ORF01 149 hypothetical protein ■ ~ 

ORF01150 hypothetical protein 

ORFQ1 1 51 hypothetical protein 

ORF01 1 52 lipoprotein, putative 

ORF01 1 53 hypothetical protein 

ORF01158 hypothetical protein 

ORF01 1 59 hypothetical protein 

ORF01 161 expressed conserved domain protein 
QRF01 1 62 conserved hypothetical protein 

ORF01 1 66 hypothetical protein 

QRF01168 conserved hypothetical protein 

ORF01 1 69 hypothetical protein 

ORF01174 conserved domain protein ~~ 

ORF01 1 75 hypothetical protein 

ORF01 186 cell wall surface anchor family protein, putative 

ORF01 1 87 hypothetical protein 

ORF01 204 hypothetical protein j 

ORF01 21 5 hypothetical protein 

ORF01258 hypothetical protein 

ORF01262 conserved hypothetical protein, FRAMESHIFT 

ORF01263 hypothetical protein 

QRF01265 hypothetical protein 

ORF01266 hypothetical protein 

ORF01304 polysaccharide biosynthesis protein CpsK(V) (cpsK) 

ORF01308 polysaccharide biosynthesis protein CpsM(V) (cpsM) 
ORF01309 polysaccharide biosynthesis protein cpsH(V) (cpsH) 

QRF01 349 hypothetical protein 

ORF01384 hypothetical protein 

ORF01 385 hypothetical protein 

ORF01 386 conserved hypothetical protein " 

ORF01 392 hypothetical protein ~~ 

QRFQ1395 conserved hypothetical protein 

QRF01409 conserved hypothetical protein 

ORF01410 hypothetical protein 

ORF01417 hypothetical protein 

ORF01418 hypothetical protein " — 

ORF01420 hypothetical protein 

ORFQ1423 conserved hypothetical protein 

ORF01424 conserved hypothetical protein ™ 
QRF01425 conserved hypothetical protein 
QRFQ1426 conserved hypothetical protein 

ORF01 427 hypothetical protein ~ 

IORF01431 hypothetical protein — 
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ORF01432 conserved domain protein 



ORF01434 hypothetical protein 



QRF01435 calcium-binding protein, putative 



ORF01437 abortive infection protein AbiGI (abiGI) 



ORF01438 abortive infection protein AbiGII (abiGll) 



ORF01441 conserved hypothetical protein, degenerate 



ORFQ1443 hypothetical protein 



ORF01445 hypothetical protein 



ORF01452 hypothetical protein 



ORF01459 hypothetical protein 



ORF01463 hypothetical protein 



ORF01464 hypothetical protein 



ORF01465 hypothetical protein 



ORF01486 hypothetical protein 



ORF01488 R5 protein 



QRFQ1575 membrane protein, putative 



ORF01581 lipoprotein, putative 



ORF016Q1 hypothetical protein 



ORF01611 hypothetical protein 



ORF01638 conserved hypothetical protein 



QRF01645 cell wall surface anchor family protein 



ORF01660 membrane protein, putative 



QRF01666 hypothetica 



protein 



ORF01667 hypothetica 



protein 



QRF01670 hypothetica 



protein 



ORF01673 hypothetica 



protein 



ORF01674 hypothetica 



protein 



ORF01675 hypothetica 



protein 



ORF01681 hypothetica 



protein 



ORF01682 hypothetica 



protein 



ORF01684 hypothetica 



protein 



ORF01719 hypothetica 



protein 



ORF01733 hypothetica 



protein 



ORFQ1735 hypothetica 



protein 



ORF01736 hypothetica 



protein 



ORF01737 hypothetica 



protein 



ORF01742 hypothetica 



protein 



ORF01754 hypothetica 



protein 



ORF01761 transposase, IS30 family, putative, truncation 



ORF01807 hypothetica 



protein 



ORF01836 hypothetica 



protein 



ORF01838 hypothetica 



protein 



ORF01842 hypothetica 



protein 



ORF01850 hypothetica 



protein 



ORF01892 hypothetica 



protein 



ORF01893 hypothetica 



protein 



ORF01895 hypothetica 



protein 



ORF01896 hypothetica 



protein 



ORF01897 hypothetica 



protein 



QRFQ1898 hypothetica 



protein 



ORF01899 hypothetica 



protein 



QRF01905 hypothetica 



protein 



ORFQ1934 hypothetica 



protein 



ORF01943 hypothetica 



protein 



ORF01944 hypothetica 



protein 
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ORF01953 hypothetical protein 

QRFQ1984 hypothetical protein 

QRF01988 hypothetical protein 

QRF01989 hypothetical protein 

QRF02005 hypothetical protein 

ORF0201 1 hypothetical protein 

ORF02016 hypothetical protein 

ORF02025 hypothetical protein 

QRF02026 hypothetical protein 

ORF02045 conserved hypothetical protein 

ORF02047 prophage LambdaSa2, hoiin, putative 

ORF02048 conserved hypothetical protein 

ORF02049 hypothetical protein 

QRF02050 conserved domain protein 

ORF02053 conserved hypothetical protein 

ORF02Q57 hypothetical protein 

ORFQ2058 hypothetical protein 

ORF02061 hypothetical protein 

ORFQ2062 hypothetical protein 

QRF02063 conserved domain protein 

ORF02067 conserved hypothetical protein 

ORFQ2069 hypothetical protein 

ORF0207Q hypothetical protein 

QRF02072 conserved hypothetical protein 

ORF02Q73 prophage LambdaSa2, transcriptional regulator, Cro/CI family 

ORF02075 hypothetical protein 

QRF02077 hypothetical protein 

ORF02078 conserved hypothetical protein 

QRF02081 hypothetical protein 

QRF02085 hypothetical protein 

QRF02087 hypothetical protein 

QRF02088 conserved hypothetical protein 

ORF02091 conserved domain protein 

ORFQ2092 hypothetical protein 

QRF02093 hypothetical protein 

ORF02094 hypothetical protein 

ORF02097 hypothetical protein 

QRFQ210Q hypothetical protein 

ORF02102 hypothetical protein 

ORF02108 hypothetical protein 

ORF02119 hypothetical protein 

ORFQ2124 hypothetical protein 

QRF02171 membrane protein, putative 

ORF02172 hypothetical protein 

ORF02173 membrane protein, putative 

ORF02191 hypothetical protein 

ORFQ2196 hypothetical protein 

ORF02203 hypothetical protein 

QRF022Q8 hypothetical protein 

ORF02212 hypothetical protein 

ORF02213 hypothetical protein 

QRF02214 transcriptional regulator, Cro/CI family 

ORF02215 expressed protein of unknown function 

ORF02217 conserved hypothetical protein 

QRF02219 hypothetical protein 
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QRF02221 cell wall anchor protein-related protein 

ORF02223 hypothetical protein 

ORF02224 hypothetical protein 

ORFQ2225 hypothetical protein 

ORF02231 conserved hypothetical protein 

ORF02235 hypothetical protein 

ORFQ2236 conserved hypothetical protein ~~ 
QRF02237 hypothetical protein 

ORF02238 hypothetical protein 

ORF02239 hypothetical protein 

ORF02241 hypothetical protein 

ORF02244 hypothetical protein ~~ 

ORF02245 hypothetical protein 

QRF02263 hypothetical protein 

ORF02268 protease, putative 

ORF02271 hypothetical protein 

ORF02279 hypothetical protein 

ORF02283 chromosome assembly-related protein 

ORF02317 hypothetical protein 

ORF02330 hypothetical protein 

ORFQ2344 site-specific recombinase, phage integrase family 

ORF02345 conserved hypothetical protein 

ORFQ2347 hypothetical protein 

ORF02349 conserved hypothetical protein ~~ 
ORF02350 hypothetical protein ~~ 
ORFQ2351 transcriptional regulator, Cro/Cl family _ 

ORF02354 hypothetical protein 

ORF02356 expressed putative secreted protein 

ORF02395 transcriptional regulator, Cro/Cl family 
ORF02418 hypothetical protein 
ORFQ2422 hypothetical protein 
ORF02425 conserved hypothetical protein 
ORF03004 conserved hypothetical protein 

QRF03005 cylX protein 

ORF03006 Tn916 t hypothetical protein 

ORF03007 Tn916, hypothetical protein 

ORF030Q8 Tn916, hypothetical protein 

ORF03009 Tn916, tetM leader peptide \ 

QRF03010 Tn916, hypothetical protein 

ORF0301 5 conjugal transfer protein, interruption-N 
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Table 13; Comparative Sequences relating to SAG0466 (thiolase) 



SEQ ID NO. 1301: SAG0466 FROM THE 2603V/R GBS STRAIN 

CTCCTGCCCCTGCAATGGCAGTTAGACCCATAGGTTTATTTTTATATTTTAATGCCTGCATAAGATGAAGGATATTAATA 
ATTCCTGAGCAGGCATAAGGGTGTCCGTAAGCTAATGTCCCTCCAAAAATATTGAATTTTTCTCTCTCTTCAGGATAATA 
ATGATTAAATAGAGCATCAATCGCTGCAAATGGTTCATTCCATTCAATTGCATCATAATCCGATATTTTAGTATGAGTTT 
CTGTTAATAGTTTTTCGGTAGCCGTGTGAACCAATTCTGGACTAAGCTTGGGATCTCCTGCTACTTCTACAATGTGAACA 
ATCCGGAATTCTGTTTTCTGACTCTGAAGCGTTAGAAATGCAGCAGCATCGTGCATTAAACAAACATTTCCAATAGTGAG 
CAAAGGTGAATTTTCCATCAATCTTGGTAATTTTTGAAAAAATGTTtCTTTTaGTTTTCTAACGCCTTGATCTCGCATCC 
CTTCCATTGGTAAGATTACyTCTTCTAAATAGCCACCTTGTTTAGCTGTTAAGGCGCGTTTATGGCTCAAGAATGCCAAT 
TTATCTAACATTTCTCTTCTAAAaCCATATTTTTGACAGACTCTCTGGGCCCCTTCTAACATTACAGTTTCAGCATAAGA 
GTCAGGAGAAAACTGAGCAACTGTATATTCTCCGTTACGATTATCTTCTTTAGCATAACGTCTCATAGGTTGAAGAGAAC 
TACTTTCAATCCCCCCAACAAGAACTTTTTCATTAATACCGGTACTGATTTTTAGATAACCAAAAAACAAGGCAGAACTT 
GATGAAGCACACTGCATATCAATCGTTTGTACTGGAATATAGGATTCATAATCAGAAAAAAGAGTCATCAAACGACCAAT 
ATTGCCCCCAGTACCAACTGTGTTCCCACAAATAATACTATCAATGTTAGATTCTGATTCTATTTTTTTTATTTGATTTA 
AAAGGTGTGCTCCTAAAAGTTCTGGACGGTAAGTTTAAATTGCTT 

SEQ ID NO- 1302: SAG0466 FROM THE M732 GBS TYPE III STRAIN 

T CGGT AT AAAAGGGAAGC AATTTAAACATTACCGT CCAGAACTTTTAGGAGCAC AC CTC TTAAATCAAATAAAAAAAATA 
GAATCAGAATCTAATATTGATAGTATTATTTGTGGGAACACAGTTGGTACTGGGGGCAATATTGGTCGTTTGATGACTCT 
TTTTTCTGATTATGAATCCTATATTCCAGTACAAACGATTGATATGCAGTGTGCTTCATCAAGTTCTGCCTTGTTTTTTG 
GTTATCTAAAAATCAGTGCCGGTATTAATGAAAAAGTTCTTGTTGGGGGGATTGAAAGTAGTTCTCTTCAACCTATGAGA 
CGTTACGCTAAAGAAGATAATCGTAACGGAGAATATACCGTTGCTCAGTTTTCTCCTGACTCTTATGCTGAAACTGTAAT 
GTTAGAAGGGGCACAAAGAGTCTGTCAAAAATATGGTTTTAGAAGAGAAATGTTAGATAAATTGGCATTCTTGAGCCATA 
AACGCGCCTTAACAGCTAAACAAGGTGGCTATTTAGAAGAGGTAATCTTACCAATGGAAGGGATGCGAGATCAAGGCGTT 
AGAAAACTAAAAGAAGCATTTTTTCAAAAATTACCAAGATTGATGGAAAATTCACCTTTGCTCACTATTGGAAATGTTTG 
TTTAATGCACGATGCTGCTGCATTTCTAACGCTTCAGAGTCAGAAAACAGAATTCCGGATTGTTCACATTGTAGAAGTAG 
C AGGAGAT CCC AAGCTTAGTC CAGAATTGGTTCACACGGCTACGGAAAAAC TATTAACAGAAACTCATACTAAAATATCG 
GATTATGATGCAATTGAATGGAATGAACCATTTGCAGCGATTGATGCTTTATTTAATCATTATTATCCTGAAGAGAGAGA 
AAAATTCAATATTTTTGGAGGGGCATTAGCTTACGGACACCCTTATGCCTGCTCAGGAATTA 

SEQ ID NO. 1303: SAG0466 FROM THE 090 GBS TYPE la STRAIN 

TTGTGGGAACACAGTTGGTACTGGGGGCAATATTGGTCGTTTGATGACTCTTTTTTCTGATTATGAATCCTATATTCCAG 
TACAAACGATTGATATGCAGTGTGCTTCATCAAGTTCTGCCTTGTTTTTTGGTTATCTAAAAATCAGTGCCGGTATTAAT 
GAAAAAGTTCTTGTTGGGGGGATTGAAAGTAGTTCTCTTCAACCTATGAGACGTTACGCTAAAGAAGATAATCGTAACGG 
AGAATATACCGTTGCTCAGTTTTCTCCTGACTCTTAkGCTGAAACTGTAATGtTAGAAGGGGCACAAAGAGTCTGTCAAA 
AATATGGTTTtAGAAGAGAAATGTTAGATAAATTGGCATTCTTGAGCCATAAACGCGCCTTAACAGCTAAACAAGGTGGC 
TATTTAGAAGAGGTAATCTTACCAATGGAAGGGATGCGAGATCAAGGCGTTAGAAAACTAAAAGAAGCATTTTTTCAAAA 
ATTACCAAGATTGATGGrAAATTCACCTTTGCTCACTATTGGAAATGTTTGTTTAATGCACGATGCTGCTGCATTTCTwA 
CGCTTCAGAGTCAGAAAACAGAATTCCGGATTGTTCACATTGTAGAAGTAGCAGGAGATCCCAAGCTTAGTCCAGAATTG 
GTTCACACGGCTACGGAAAAACTATTAACAGAAACTCATACTAAAATATCGGATTATGATGCAATTGAATGGAATGAACC 
ATTTGCAGCGATTGATGCTTTATTTAATCATTATTATCCTGAAGAGAGAGAAAAATTCAATATTTTTGGAGGGGCATTAG 
CTTACGGACACCCTTATGCCTGCTCAGG 

SEQ ID NO. 1304: SAG0466 FROM THE COHl GBS TYPE la STRAIN 

ATCGGTATAAAAGGGAAGCAATTTAAAATTAC CGTCCAGAACTTTTAGGAG CACACCTC TTAAATCAAATAAAAAAAATA 
GAATCAGAATCTAATATTGATAGTATTATTTGTGGGAACACAGTTGGTACTGGGGGCAATATTGGTCGTTTGATGACTCT 
TTTTTCTGATTATGAATCCTATATTCCAGTACAAACGATTGATATGCAGTGTGCTTCATCAAGTTCTGCCTTGTTTTTTG 
GGTATCTAAAAA 

SEQ ID NO. 1305 : SAG0466 FROM THE CJB GBS NONTYPEABLE STRAIN (REVERSE 
COMPLEMENT) 

TTTTCAAAAATTACCAAGATTGATGGAAAATTCACCTTTGCTCACTATTGGAAATGTTTGTTTAATGCACGATGCTGCTG 
CATTTCTAACGCTTCAGAGTCAGAAAACAGAATTCCGGATTGTTCACATTGTAGAAGTAGCAGGAGATCCCAAGCTTAGT 
CCAGAATTGGTTCACACGGCTACGGAAAAACTATTAACAGAAACTCATACTAAAATATCGGATTATGATGCAATTGAATG 
GAATGAACCATTTGCAGCGATTGATGCTTTATTTAATCATTATTATCCTGAAGAGAGAGAAAAATTCAATATTTTTGGAG 
GGGCATTAGCTTACGGACACCCTTAATGCCTGCTCAGGAATTATTAATATCC 

SEQ ID NO. 1306: sag0466 FROM THE CJB110 GBS NONTYPEABLE STRAIN 

GGTATAAAAGGGAAGCAATTTAAACATTACCGTCCAGAACTTTTAGGAGCACACCTCTTAAATCAAATAAAAAAAATATA 
ACCAGAATCTAACATTGATAGTATTATTTGTGGGAACACAGTTGGTACTGGGGGCAATATTGGTCGTTTGATGACTCTTT 
TTTCTGATTATGAATCCTATATTC 
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Table 13: Comparative Sequences relating to SAG0466 (thiolase) 



SEQ ID NO. 1307: SAG0466 FROM THE 1169NT1 GBS TYPE V STRAIN REVERSE COMPLEMENT 

CAAGATTGATGGAAAATTCACCTTTGCTCACTATTGGAAATGTTTGTTTAATGCACGATGCTGCTGCATTTCTAACGCTT 
C AGAGTC AGAAAACAGAATTC CGGATTGTTC ACATTGTAG AAGTAGC AGGAGAT C C CAAGCTTAGTC C AGAATTGGTTC A 
CACGGCTACGGAAAAACTATTAACAGAAACTCATACTAAAATATCGGATTATGATGCAATTGAATGGAATGAACCATTTG 
CAGCGATTGATGCTTTATTTAATCATTATTATCCTGAAGAGAGAGAAAAATTCAATATTTTTGGAGGGGCATTAGCTTAC 
GGAC AC C C TTATGC CTGCTCAGGAATTATTAATATC CTTC ATCTTATGC AGG CATTAAAATATAAAAATAAAC CTATGGG 
CCTAACTGCCATTGCAGGGGCA 

SEQ ID NO. 1308: SAG0466 FROM THE 18RS21 GBS TYPE II STRAIN 

CCTTAACAGTTAAACAAGGTGGCTATTTAGAAGAGGTAATCTTACCAATGGAAGGGATGCGAGATCAAGGCGTTAGAAAA 
CTAAAAGAAACATTTTTTCAAAAATTACCAAGATTGATGGAAAATTCACCTTTGCTCACTATTGGAAATGTTTGTTTAAT 
GCACGATGC TGC TGCATTTCTAACGC TTC AGAGTC AGAAAACAGAATT CCGGATTGT TCACATTGTAGAAGTAGCAGGAG 
ATC C C AAGCTTAGTCCAGAATTGGTTC ACACGGCTACGGAAAAACTATTAACAGAAACTCATAC TAAAATATCGGATTAT 
GATGCAATTGAATGGAATGAACC ATTTGCAGCGATTGATGCTC TATTTAAT CATTATTAT C C TGAAGAGAGAGAAAAATT 
CAATATTTTTGGAGGGACATTAGCTTACGGACACCCTTATGCCTGCTCAGGAATTATTAATATCCTTCATCTTATGCAGG 
CATTAAAATATAAAAATAAACCTATGGGTCTAACTGCCATTGCAGGGGCAG 

SEQ ID NO. 1309: SAG0466 FROM THE 18RS21 GBS TYPE II STRAIN 

TCGGTATAAAAGGGAAGCAATTTAAACATTACCGTCCAGAACTTTTAGGAGCACACCTTTTAAATCAAATAAAAAAAATA 
GAATCAGAATCTAACATTGATAGTATTATTTGTGGGAACACAGTTGGTACTGGGGGCAATATTGGTCGTTTGATGACTCT 
TTTTTCTGATTATGAATCCTATATTCCAGTACAAACGATTGATATGCAGTGTGCTTCATCAAGTTCTGCCTTGTTTTTTG 
GTTATCTAAAAATCAGTACCGGTATTAATGAAAAAGTTCTTGTTGGGGGGATTGAAAGTAGTTCTCTTCAACCTATGAGA 
CGTTATGCTAAAGAAGATAATCGTAACGGAGAATATACAGTTGCTCAGTTTTCTCCTGACTCTTATGCTGAAACTGTAAT 
GTTAGAAGGGGCCCAGAGAGTCTGTCAAAAATATGGTTTTAGAAGAGAAATGTTAGATAAATTGGCATTCTTGAGCCATA 

AACGCGCCTTAACAGCTAAACA 

SEQ ID NO. 1310: SAG0466 FROM THE H36b GBS TYPE lb STRAIN 

TTTGGGCT ACGAAC AC CTATCGGTATAAAAGGGAAGC AATTTAAACATTAC CGTC C AGAACTTTTAGGAGCACAC C TTTT 
AAATCAAATAAAAAAAATAGAATCAGAATCTAACATTGATAGTATTATTTGTGGGAACACAGTTGGTACTGGGGGCAATA 
TTGGTCGTTTGATGACTCTT TTTT CTGATTATGAAT C C TAT ATTC C AGTACAAACGATTGATATGCAGTGTGCTT CATCA 
AGTTCTGCCTTGTTTTTTGGTTATCTAAAAATCAGTACCGGTATTAATGAAAAAGTTCTTGTTGGGGGGATTGAAAGTAG 
TTCTCTTCAACCTATGAGACGTTATGCTAAAGAAGATAATCGTAACGGAGAATATACAGTTGCTCAGTTTTCTCCTGACT 

CTTATGCTGAAAC TGTAATGTTAGAAGGGGCC C 

SEQ ID NO. 1311: SAG0466 FROM THE H36b GBS TYPE lb STRAIN (REVERSE COMPLEMENT) 

GAAAATTCACCTTTGCTCACTATTGGAAATGTTTGTTTAATGCACGATGCTGCTGCATTTCTAACGCTTCAGAGTCAGAA 
AACAGAATTCCGGATTGTTCACATTGTAGAAGTAGCAGGAGATCCCAAGCTTAGTCCAGAATTGGTTCACACGGCTACGG 
AAAAACTATTAACAGAAACTCATACTAAAATATCGGATTATGATGCAATTGAATGGAATGAACCATTTGCAGCGATTGAT 
GCTCTATTTAATCATTATTATCCTGAAGAGAGAGAAAAATTCAATATTTTTGGAGGGACATTAGCTTACGGACACCCTTA 
TGCCTGCT CAGGAATTATTAATATCCTT C ATC T TATGC AGGC ATTAAAATATAAAAATAAAC CTATGGGTCTAACTGCCA 
TTGCAGGGGCAGGA 

SEQ ID NO. 1312: SAG0466 FROM THE M781 GBS TYPE III STRAIN (REVERSE COMPLEMENT) 

CCTTTGCTCACTATTGGAAATGTTTGTTTAATGCACGATGCTGCTGCATTTCTAACGCTTCAGAGTCAGAAAACAGAATT 
CCGGATTGTTC ACATTGTAGAAGTAGC AGGAGATC CCAAGC TTAGTC C AGAATTGGTTC AC ACGGCTACGGAAAAACTAT 
TAACAGAAACTC ATAC TAAAATATCGGATTATGATGC AATTGAATGGAATGAAC C ATTTGCAGCGATTGATGCT TTATTT 
AAT CAT TAT TAT C C TG AAG AG AG AG AAAAATT C AAT AT TTTT GGAGGGG C AT TAG C T T ACGGAC AC C C T T AT G C C T G C T C 
AGGAATTATTAATAT C CTTCATCTTATGCAGGC ATTAAAATATAAAAATAAACCTATGGGTTCTAAGTGC 

SEQ ID NO. 1313: SAG0466 FROM THE M781 GBS TYPE III STRAIN 

GCAATTTAAAC ATTAC CGTCC AGAAC TTTTAGGAGCAC AC CTC TTAAATCAAATAAAAAAAATAGAATC AGAAT CTAAT A 
TTGATAGTATTATTTGTGGGAACACAGTTGGTACTGGGGGCAATATTGGTCGTTTGATGACTCTTTTTTCTGATTATGAA 
TCCTATATTCCAGTACAAACGATTGATATGCAGTGTGCTTCATCAAGTTCTGCCTTGTTTTTTGGTTATCTAAAAATCAG 
TGCCGGTATTAATGAAAAAGTTCTTGTTGGGGGGATTGAAAGTAGTTCTCTTCAACCTATGAGACGTTACGCTAAAGAAG 
ATAATCGTAACGGAGAATATAC CGTTG C T C AGTTT TCTCCTGACTCTTATGCTGAAACTGTAATGT TAGA 

SEQ ID NO 1314: SAG0466 FROM THE A909 GBS TYPE la STRAIN (REVERSE COMPLEMENT) 

CCTTTGCTCACTATTGGAAATGTTTGTTTAATGCACGATGCTGCTGCATTTCTAACGCTTCAGAGTCAGAAAACAGAATT 
CCGGATTGTTCACATTGTAGAAGTAGCAGGAGATCCCAAGCTTAGTCCAGAATTGGTTCACACGGCTACGGAAAAACTAT 
TAACAGAAACTCATACTAAAATATCGGATTATGATGCAATTGAATGGAATGAACCATTTGCAGCGATTGATGCTCTATTT 
AATCATTATTATCCTGAAGAGAGAGAAAAATTCAATATTTTTGGAGGGACATTAGCTTACGGACACCCTTATGCCTGCTC 
AGGAATTATTAATATCCTTCATCTTATGCAGGCATTAAAATATAAAAATAAACCTATGGGTCTAACTGCCATTGCAGGGG 

C 
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Table 13: Comparative Sequences relating to SAG0466 (thiolase) 



SEQ ID NO.- 1315: SAG0466 FROM THE JM9130013 GBS TYPE VIII STRAIN REVERSE 
COMPLEMENT 

GCTCACTATTGGAAATGTTTGTTTAATGCACGATGCTGCTGCATTTCTAACGCTTCAGAGTCAGAAAACAGAATTCCGGA 
TTGTTCACATTGTAGAAGTAGC AGGAGAT C CCAAGCTTAGT CCAGAATTGGTT CACACGGCTACGGAAAAACTATTAAC A 
GAAACTCATACTAAAATATCGGATTATGATGCAATTGAATGGAATGAACCATTTGCAGCGATTGATGCTCTATTTAATCA 
TTATTATC CTGAAGAGAGAGAAAAATTCAATATT TTTGGAGGGGCATTAGCTTACGGAC AC CCTTATGC CTGCTC AGGAA 
TTATTAATATC CTTCATCTTATGC AGGCATTAAAATATAAAAATAAAC CTATGGGT CTAACTGCCATTGCAGGGGCAGGA 

SEQ ID NO. 1316: SAG0466 FROM THE JM9130013 GBS TYPE VIII STRAIN 

TT TGGGC TACGAAC AC CTAT CGGTATAAAAGGGAAGCAATTTAAAC ATTAC CGTCC AGAACTTTTAGGAGCAC AC CTTTT 
AAATCAAATAAAAAAAATAGAATCAGAATCTAACATTGATAGTATTATTTGTGGGAACACAGTTGGTACTGGGGGCAATA 
TTGGTCGTTTGATGACTCTTTTTTCTGATTATGAATCCTATATTCCAGTACAAACGATTGATATGCAGTGTGCTTCATCA 
AGTTCTGCCTTGTTTTTTGGTTATCTAAAAATCAGTACCGGTATTAATGAAAAAGTTCTTGTTGGGGGGATTGAAAGTAG 
TTC TCTTC AAC CTATGAGACGT TATGCTAAAGAAGATAATCGTAACGGAGAATATA 

SEQ1301 CTCCTGCCCCTGCAATGGCAGTTAGACCCATAGGTTTATTTTTATATTTTA 

SEQ1302 -~ 

SEQ1303 

SEQ1304 

SEQ1305 

SEQ1306 

SEQ1307 . 

SEQ13 0 8 CTTAACAGTTAAACAAGGTGGCTATTTAGAAGAGGTAATCTTACCAATGGAAGGGATGC 

SEQ1309 

SEQ1310 

SEQ1311 

SEQ1312 

SEQ1313 " 

SEQ1314 

SEQ1315 

SEQ1316 



SEQ1301 TGC CTGC ATAAGATGAAGGATATTAATAATTCC TGAGC AGGC ATAAGGGTGTC CGTAAG 

SEQ13 02 - r TCGGTATAAA 

SEQ1303 

SEQ13 04 ATCGGTATAAA 

SEQ13 05 TTT TC AAAAATTAC CAAGATTGATGG 

SEQ1306 GGTATAAA 

SEQ1307 CAAGATTGATGG 

SEQ13 0 8 AGATCAAGGCGTTAGAAAACTAAAAGAAACATTTTTTCAAAAATTACCAAGATTGATGG 

SEQ130 9 TCGGTATAAA 

SEQ1310 TTTGGGCTACGAACACCTATCGGTATAAA 

SEQ1311 G 

SEQ1312 ~ 

SEQ1313 

SEQ1314 

SEQ1315 

SEQ1316 TTTGGGCTACGAACACCTATCGGTATAAA 

SEQ13 0 1 TAATGTCCCTCCAAA - AATATTGAATTTTTCTCTCTC - TTCAGGATAATAATGATTAAA 

SEQ13 0 2 GGGAAGCAATTTAAACATTACCGTCCAGAACTTTTAGGAGCACACCTCTTAAATCAAAT 

SEQ1303 

SEQ1304 GGGAAGCAATTTAAA-ATTACCGTCCAGAACTTTTAGGAGCACACCTCTTAAATCAAAT 

SEQ13 0 5 AAATTCACCTTTGCTCACTATTGGAAATGTTTGTTTAATGCACGATGCTGCTGCATTTC 

SEQ13 0 6 GGGAAGCAATTTAAACATTACCGTCCAGAACTTTTAGGAGCACACCTCTTAAATCAAAT 

SEQ 1 3 0 7 AAATTCACCTTTGCTCACTATTGGAAATGTTTGTTTAATGCACGATGCTGCTGCATTTC 

SEQ13 0 8 AAATTCACCTTTGCTCACTATTGGAAATGTTTGTTTAATGCACGATGCTGCTGCATTTC 

SEQ13 0 9 GGGAAGCAATTTAAACATTACCGT CC AGAACTTTTAGGAGCAC AC CTT TTAAATCAAAT 

SEQ131 0 GGGAAGCAATTTAAACATTACCGTCCAGAACTTTTAGGAGCACACCTTTTAAATCAAAT 

SEQ1311 AAATTCACCTTTGCTCACTATTGGAAATGTTTGTTTAATGCACGATGCTGCTGCATTTC 

SEQ13 12 CCTTTGCTCACTATTGGAAATGTTTGTTTAATGCACGATGCTGCTGCATTTC 

SEQ1313 GCAATTTAAACATTACCGTCCAGAACTTTTAGGAGCACACCTCTTAAATCAAAT 

SEQ1314 CCTTTGCTCACTATTGGAAATGTTTGTTTAATGCACGATGCTGCTGCATTTC 

SEQ1315 GCTCACTATTGGAAATGTTTGTTTAATGCACGATGCTGCTGCATTTC 
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Table 13: Comparative Sequences relating to SAG0466 (thiolase) 



SEQ1316 



GGGAAGCAATTTAAACATTACCGTCCAGAACTTTTAGGAGCACACCTTTTAAATCAAAT 



SEQ1301 
SEQ1302 
SEQ1303 
SEQ1304 
SEQ1305 
SEQ13 0 6 
SEQ1307 
SEQ1308 
SEQ1309 
SEQ1310 
SEQ1311 
SEQ1312 
SEQ1313 
SEQ1314 
SEQ1315 
SEQ1316 

SEQ1301 
SEQ1302 
SEQ1303 
SEQ1304 
SEQ1305 
SEQ1306 
SEQ1307 
SEQ13 08 
SEQ1309 
SEQ1310 
SEQ1311 
SEQ1312 
SEQ1313 
SEQ1314 
SEQ1315 
SEQ1316 

SEQ1301 
SEQ1302 
SEQ1303 
SEQ1304 
SEQ1305 
SEQ1306 
SEQ1307 
SEQ1308 
SEQ1309 
SEQ1310 
SEQ1311 
SEQ1312 
SEQ1313 
SEQ1314 
SEQ1315 
SEQ1316 



AGAGCATCAATCGCTGCAAATGGTTCATTCC-ATTCAATTGCATCATAATCCGATATTT 

AAAAAAATAGAATCAGAATCTAAT- - ATT GATAGTATTATTTGTGGGAACA- CAGT 

TTGTGGGAACA- CAGT 



AAAAAAATAGAATCAGAATCTAAT - 
AACGCTTCAGAGTCAGAAAACAGA - 
AAAAAAATATAACC AGAATC TAAC • 
AACGCTTCAGAGTCAGAAAACAGA- 
AACGCTTCAGAGTCAGAAAACAGA- 
AAAAAAATAGAATC AGAATCTAAC • 
AAAAAAATAGAATCAGAATCTAAC - 
AACGCTTCAGAGTCAGAAAACAGA - 
AACGCTTCAGAGTCAGAAAACAGA - 
AAAAAAATAGAATC AGAATCTAAT - 
AACGCTTCAGAGTCAGAAAACAGA - 
AACGCTTCAGAGTCAGAAAACAGA - 
AAAAAAATAGAATCAGAATCTAAC - 



•ATT GATAGTATTATTTGTGGGAACA- CAGT 

•ATTCCGGATTGTTCACATTGTAGAAGTAGCAGG 
■ATT - - -GATAGTATTATTTGTGGGAACA- CAGT 

• ATTCCGGATTGTTCACATTGTAGAAGTAGCAGG 
•ATTCCGGATTGTTCACATTGTAGAAGTAGCAGG 

. ATT GAT AGTATT AT TTGTGGGAACA - CAGT 

. ATT GATAGTATTATTTGTGGGAACA - CAGT 

- ATTCCGGATTGTTCACATTGTAGAAGTAGCAGG 

• ATTCCGGATTGTTCACATTGTAGAAGTAGCAGG 

-ATT GATAGTATTATTTGTGGGAACA- CAGT 

-ATTCCGGATTGTTCACATTGTAGAAGTAGCAGG 

• ATTC CGGATTGTTC ACATTGTAGAAGTAGC AGG 
-ATT GATAGTATTATTTGTGGGAACA- CAGT 



AGTATGAGTTTCTGTTAATAGTTTTTCCGTAGCCGTGTGAACCAATTCTGGACTAAGCT 
GGTAC TGGGGGCAATATTGG - TCGTTTGATGACTCTTTTTTCTGATTATGAATCCTA - - 
GGTACTGGGGGCAATATTGG - TCGTTTGATGACTCTTTTTTCTGATTATGAATCCTA - - 
GGTACTGGGGGCAATATTGG - TCGTTTGATGACTCTTTTTTCTGATTATGAATCCTA - - 
GATCC CAAGC TTAGTCCAGAATTGGTTC AC ACGGCTACGGAAAAACTATTAACAGAAAC 
GGTACTGGGGGCAATATTGG - TCGTTTGATGACTCTTTTTTCTGATTATGAATCCTA- - 
GATC CCAAGCTTAGTCCAGAATTGGTTCAC ACGGC TACGGAAAAACTATTAACAGAAAC 
GATCCCAAGCTTAGTCCAGAATTGGTTCACACGGCTACGGAAAAACTATTAACAGAAAC 
GGTACTGGGGGCAATATTGG - TCGTTTGATGACTCTTTTTTCTGATTATGAATCCTA - - 
GGTACTGGGGGCAATATTGG- TCGTTTGATGACTCTTTTTTCTGATTATGAATCCTA- - 
GATCCCAAGCTTAGTCCAGAATTGGTTCACACGGCTACGGAAAAACTATTAACAGAAAC 
GATCCCAAGCTTAGTCCAGAATTGGTTCACACGGCTACGGAAAAACTATTAACAGAAAC 
GGTACTGGGGGCAATATTGG - TCGTTTGATGACTCTTTTTTCTGATTATGAATCCTA- - 
GATCCCAAGCTTAGTCCAGAATTGGTTCACACGGCTACGGAAAAACTATTAACAGAAAC 
GAT C C C AAG C T T AGT C C AG AAT TGGT T C AC ACGG C T ACGG AAAAAC TAT T AAC AGAAAC 
GGTACTGGGGGCAATATTGG - TCGTTTGATGACTCTTTTTTCTGATTATGAATCCTA - - 

GGGATCTCCTGCTACTTCTACAATGTGAACAATCCGGA-ATTCTGTTTTCTGACTCTGA 
TATTCC AGTACAAACGATTGATATG CAGTGTGCTTCAT CAAGT - - TCTGCCTTGTTTTT 
TATTCCAGTACAAACGATTGATATGCAGTGTGCTTCATCAAGT - - TCTGCCTTGTTTTT 
TATTC CAGTACAAACGAT TGATATGCAGTGTGCTTC ATCAAGT - - TCTGCCTTGTTTTT 
C ATACTAAAATAT CGGATTATGATGC AATTGAATGGAATGAAC CATTTGCAGCGATTGA 

TATTC 

CATACTAAAATATCGGATTATGATGCAATTGAATGGAATGAACCATTTGCAGCGATTGA 
CATACTAAAATATCGGATTATGATGCAATTGAATGGAATGAACCATTTGCAGCGATTGA 
TATTCCAGTACAAACGATTGATATGCAGTGTGCTTCATCAAGT - -TCTGCCTTGTTTTT 
TATTCCAGTACAAACGATTGATATGCAGTGTGCTTCATCAAGT- -TCTGCCTTGTTTTT 
CATACTAAAATATCGGATTATGATGCAATTGAATGGAATGAACCATTTGCAGCGATTGA 
CATACTAAAATATCGGATTATGATGCAATTGAATGGAATGAACCATTTGCAGCGATTGA 
TATTCCAGTACAAACGATTGATATGCAGTGTGCTTCATCAAGT - - TCTGCCTTGTTTTT 
CATACTAAAATATCGGATTATGATGCAATTGAATGGAATGAACCATTTGCAGCGATTGA 
CAT AC TAAAATATCGGATTATGATGC AATTGAATGGAATGAAC C ATTTGC AGCGAT TGA 
TATTCCAGTACAAACGATTGATATGCAGTGTGCTTCATCAAGT - - TCTGCCTTGTTTTT 
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Table 13: Comparative Sequences relating to SAG0466 (thiolase) 



SEQ1301 
SEQ1302 
SEQ1303 
SEQ1304 
SEQ1305 
SEQ1306 
SEQ1307 
SEQ1308 
SEQ1309 
SEQ1310 
SEQ1311 
SEQ1312 
SEQ1313 
SEQ1314 
SEQ1315 
SEQ1316 

SEQ1301 
SEQ1302 
SEQ1303 
SEQ1304 
SEQ1305 
SEQ1306 
SEQ1307 
SEQ1308 
SEQ1309 
SEQ1310 
SEQ1311 
SEQ1312 
SEQ1313 
SEQ1314 
SEQ1315 
SEQ1316 

SEQ1301 
SEQ1302 
SEQ1303 
SEQ1304 
SEQ1305 
SEQ1306 
SEQ1307 
SEQ1308 
SEQ1309 
SEQ1310 
SEQ1311 
SEQ1312 
SEQ1313 
SEQ1314 
SEQ1315 
SEQ1316 



GCGTTAGAAATGCAGCAGCATCGTGCATTAAACAAACATTTC - - CAATAGTGAGCAAAG 
GGT - TATCTAAAAATCAGTG - C CGGTATTAATGAAAAAGT TCTTGTTGGGGGGATTGAA 
GGT - TATCTAAAAATCAGTG - CCGGTATTAATGAAAAAGTTCTTGTTGGGGGGATTGAA 

GGG - TATCTAAAAA 

GCTTTATTTAATCATTATTATCCTGAAGAGAGAGAAAAATTCAATATTTTTGGAGGGGC 

GCTTTATTTAATCATTATTATCCTGAAGAGAGAGAAAAATTCAATATTTTTGGAGGGGC 
GCTCTATTTAATCATTATTATCCTGAAGAGAGAGAAAAATTCAATATTTTTGGAGGGAC 
GGT - TATCTAAAAATCAGTA- CCGGTATTAATGAAAAAGTTCTTGTTGGGGGGATTGAA 
GGT - TATCTAAAAATCAGTA- CCGGTATTAATGAAAAAGTTCTTGTTGGGGGGATTGAA 
GCTCTATTTAATCATTATTATCCTGAAGAGAGAGAAAAATTCAATATTTTTGGAGGGAC 
GCTTTATTTAATCATTATTATCCTGAAGAGAGAGAAAAATTCAATATTTTTGGAGGGGC 
GGT - TATCTAAAAATCAGTG - CCGGTATTAATGAAAAAGTTCTTGTTGGGGGGATTGAA 
GCTCT ATTTAATCATTATTAT CCTGAAGAGAGAGAAAAATT CAATATTTTTGGAGGGAC 
GCTCTATTTAATCATTATTATCCTGAAGAGAGAGAAAAATTCAATATTTTTGGAGGGGC 
GGT - TATCTAAAAATCAGTA - CCGGTATTAATGAAAAAGTTCTTGTTGGGGGGATTGAA 

TGAATTTTCCATCAATCTTGG--TAATTTTTGAAAAAATGTTTCTTTTAGTTTTCTAAC 
GTAGTTCTCTTCAACCTATGAGACGTTACGCTAAAGAAGATAATCGTAACGGAGAATAT 
GTAGTTCTCTTCAACCTATGAGACGTTACGCTAAAGAAGATAATCGTAACGGAGAATAT 



TTAGCTTACGGACACCCTTAA- - TGCCTGCTCAGGAATTATTAATATCC 

TTAGCTTACGGACACCCTTA TGCCTGCTCAGGAATTATTAATATCCTTCATCTTAT 

TTAGCTTACGGACACCCTTA - - - TGCCTGCTCAGGAATTATTAATATCCTTCATCTTAT 
GTAGTTCTCTTCAACCTATGAGACGTTATGCTAAAGAAGATAATCGTAACGGAGAATAT 
GTAGTT CTCTTC AAC CTATGAGACGTTATGC TAAAGAAGATAATCGTAACGGAGAATAT 
TTAGCTTACGGACACCCTTA- - -TGCCTGCTCAGGAATTATTAATATCCTTCATCTTAT 
TTAGCTTACGGACACCCTTA- - -TGCCTGCTCAGGAATTATTAATATCCTTCATCTTAT 
GTAGTT CT CTTC AACCTATGAGACGTTACGCTAAAGAAGATAATCGTAACGGAGAATAT 
TTAGCTTACGGACACCCTTA- - -TGCCTGCTCAGGAATTATTAATATCCTTCATCTTAT 
TTAGCTTACGGACACCCTTA- - - TGCC TGCT CAGGAATTATTAATATC CTTCATCTTAT 
GTAGTTCTCTTCAACCTATGAGACGTTATGCTAAAGAAGATAATCGTAACGGAGAATAT 

CCTTGATCTCGCATCCCTTCCATTGGTAAGATTACYTCTTCTAAATAGCCACCTTGTTT 
CCGTTGCTCAGTTTTCTCCTGACTCTTATGCTG - - AAAC TGTAATGTTAGAAGGGGCAC 
CCGTTGCTCAGTTTTCTCCTGACTCTTAKGCTG - - AAACTGTAATGTTAGAAGGGGCAC 

C AGGCATTAAAATATAAAAATAAAC C TATGGGC - CTAACTGCCATTGCAGGGGCA 

CAGGCATTAAAATATAAAAATAAACCTATGGGT - CTAACTGCCATTGCAGGGGCAG- - - 
CAGTTGCTCAGTTTTCTCCTGACTCTTATGCTG - - AAACTGTAATGTTAGAAGGGGCCC 
CAGTTGCTCAGTTTTCTCCTGACTCTTATGCTG- -AAACTGTAATGTTAGAAGGGGCCC 
CAGGC ATTAAAATATAAAAATAAACC TATGGGT - C T AAC T G C CAT T GC AGGGG C AGG A - 

C AGGCATTAAAATATAAAAATAAAC CTATGGGTT CTAAC TGC 

CCGTTGCTCAGTTTTCTCCTGACTCTTATGCTG--AAACTGTAATGTTAGA 

CAGGC ATTAAAAT AT AAAAATAAAC CTATGGGT - CTAACTGCCATTGCAGGGGC 

C AGG CAT TAAAAT AT AAAAAT AAAC C TATGGGT -CTAAC TGC CAT TGC AG GGG C AGG A - 

TABCMARATVSTNCSRATNGTSAGTHAS 
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Table 13: Comparative Sequences relating to SAG046 6 (thiolase) 



SEQ1301 
SEQ1302 
SEQ1303 
SEQ1304 
SEQ1305 
SEQ1306 
SEQ1307 
SEQ1308 
SEQ1309 
SEQ1310 
SEQ1311 
SEQ1312 
SEQ1313 
SEQ1314 
SEQ1315 
SEQ1316 

SEQ1301 
SEQ1302 
SEQ1303 
SEQ1304 
SEQ1305 
SEQ1306 
SEQ1307 
SEQ1308 
SEQ1309 
SEQ1310 
SEQ1311 
SEQ1312 
SEQ1313 
SEQ1314 
SEQ1315 
SEQ1316 

SEQ1301 
SEQ1302 
SEQ1303 
SEQ1304 
SEQ1305 
SEQ1306 
SEQ1307 
SEQ1308 
SEQ1309 
SEQ1310 
SEQ1311 
SEQ1312 
SEQ1313 
SEQ1314 
SEQ1315 
SEQ1316 



GCTGTTAAGGCGCGTTTATGGCTCAAGAATGCCAATTTATCTAACATTTCTCTTCTAAA 
AAGAGTCTGTCAAAAATATGGTTTTAGAAGAGAAATGTTAGATAAATTGGCATTCTTGA 
AAGAGT CTGTC AAAAATATGGTTTTAGAAGAGAAATGTTAGATAAATTGGCATT CTTGA 



GAGAGTCTGTCAAAAATATGGTTTTAGAAGAGAAATGTTAGATAAATTGGCATTCTTGA 



CCATATTTTTGACAGACTCTCTGGGCCCCTT - - CTAACATTACAGTTTCAGCATAAGAG 
CCATAAACGCGCCTTAACAGCTAAACAAGGTGGCTATTTAGAAGAGGTAATCTTACCAA 
CCATAAACGCGCCTTAACAGCTAAACAAGGTGGCTATTTAGAAGAGGTAATCTTACCAA 



CCATAAACGCGCCTTAACAGCTAAACA- 



CAGGAGAAAACTGAGCAACTGTATATTCTCCGTTACGATTATCTTCTTTAGCATAACGT 
GGAAGGGATGCGAGATCAAGGCGTTAGAAAACTAAAAGAAGCATTTTTTCAAAAATTAC 
GGAAGGGATGCGAGATCAAGGCGTTAGAAAACTAAAAGAAGCATTTTTTCAAAAATTAC 
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Table 13: Comparative Sequences relating to SAG0466 (thiolase) 



SEQ1301 
SEQ1302 
SEQ1303 
SEQ1304 
SEQ1305 
SEQ1306 
SEQ1307 
SEQ1308 
SEQ1309 
SEQ1310 
SEQ1311 
SEQ1312 
SEQ1313 
SEQ1314 
SEQ1315 
SEQ1316 

SEQ1301 
SEQ13 02 
SEQ1303 
SEQ1304 
SEQ1305 
SEQ1306 
SEQ1307 
SEQ1308 
SEQ1309 
SEQ1310 
SEQ1311 
SEQ1312 
SEQ1313 
SEQ1314 
SEQ1315 
SEQ1316 

SEQ1301 
SEQ1302 
SEQ1303 
SEQ1304 
SEQ1305 
SEQ1306 
SEQ1307 
SEQ1308 
SEQ1309 
SEQ1310 
SEQ1311 
SEQ1312 
SEQ1313 
SEQ1314 
SEQ1315 
SEQ1316 



TC ATAGGTTGAAGAGAACTACTTTCAAT C C C C CC AAC AAGAACTTTTTC ATTAATAC CG 
AAGATTGATGGAAAATTCACCTTTGCTCACTATTGGAAATGTTTGTTTAATGCACGATG 
AAGATTGATGGRAAATTCACCTTTGCTCACTATTGGAAATGTTTGTTTAATGCACGATG 



T AC TG AT T T T TAG AT AAC C AAAAAAC - - AAGG C AG AAC TTGATGAAGC AC ACTGCATAT 
TGCTGCATTTCTAACGCTTCAGAGTCAGAAAACAGAATTCCGGATTGTTCACATTGTAG 
TGCTGCATTTCTWACGCTTCAGAGTCAGAAAACAGAATTCCGGATTGTTCACATTGTAG 



AATCGTTTGTACTGGAATATAGGATTC ATAAT C AGAAAAAAGAGT CATCAAACGACCAA 
AGTAGC AGGAGAT CC C AAGCTTAGT CC AGAATTGGTTCACACGGC T ACGGAAAAACTAT 
AGTAGC AGGAGATC C CAAGCT TAGTC CAGAATTGGTT C ACACGGCTACGGAAAAACTAT 



532 



WO 2004/018646 



PCT/US2003/026827 



Table 13: Comparative Sequences relating to SAG0466 (thiolase) 



SEQ1301 
SEQ1302 
SEQ1303 
SEQ1304 
SEQ1305 
SEQ1306 
SEQ1307 
SEQ1308 
SEQ1309 
SEQ1310 
SEQ1311 
SEQ1312 
SEQ1313 
SEQ1314 
SEQ1315 
SEQ1316 

SEQ1301 
SEQ1302 
SEQ1303 
SEQ1304 
SEQ1305 
SEQ130 6 
SEQ1307 
SEQ13 08 
SEQ1309 
SEQ1310 
SEQ1311 
SEQ1312 
SEQ1313 
SEQ1314 
SEQ1315 
SEQ1316 

SEQ1301 
SEQ1302 
SEQ1303 
SEQ1304 
SEQ1305 
SEQ1306 
SEQ1307 
SEQ1308 
SEQ1309 
SEQ1310 
SEQ1311 
SEQ1312 
SEQ1313 
SEQ1314 
SEQ1315 
SEQ1316 



ATTGCCCC C AGT ACCAACTGTGTTC C CACAAATAATACTAT C AATGTTAGATTCTGATT 
AACAGAAACTCATACTAAAATATCGGATTATGATGCAATTGAATGGAATGAACCATTTG 
AACAGAAACTCATACTAAAATATCGGATTATGATGCAATTGAATGGAATGAACCATTTG 

TATTTTTTTTATTTGATTTAAAAGGTGTGCTCCTAAAAGTTCTGGACGGTAAGTTTAAA 
AGCGATTGATGCTTTATTTAATCATTATTATC C TGAAGAGAGAGAAAAATT C AATATTT 
AGCGATTGATGCTTTATTTAATCATTATTATCCTGAAGAGAGAGAAAAATTCAATATTT 

TGCTT 

TGGAGGGGCATTAGCTTACGGACACCCTTATGCCTGCTCAGGAATTA 

TGGAGGGGCATTAGCTTACGGACACC C TTATGC CTGCTCAGG 
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Table 14: Comparative Sequences relating to SAG0471 (glucokinase) 



SEQ ID NO. 1401: SAG0471 FROM THE 18RS21 GBS TYPE II STRAIN 

TTAAATTTGGTATCTTGACGCTTGAGGGAGAAGTACAAGAAAAATGKSGC^ 

TCTGATATCGTTGAATCTCTCAAACATCGTTTGAGCCTCTATGGATT^ 

AGCTGTTGATAGAACTAGTAAAACAGTAACAGGTGCT 

AAGTTGGAATTCCATTTTTTATTGATAACGATGCTAATGTTGCAGCACT^ 

GTTGTTTTCGTAACCCTCGGAACAGGAGTAGGTGGAGGTGTTATCGCAG^ 

AATTGGGCATATGATTGTTGATCCAGAAAATGGATTTACGTGCACA 

GTGTTGTTAGAGTAGCACGTCAACTCGCAGAACAATATGAGGX5TTCGTCTGCCATTAAAGCA 

AGTAAAGATATTTTTATAGCAGC^GAAGATGGGGATAAATTTGCTAATTCTGTTGTTGAACGTGTATCACGTTACCTTGGACTGGCAGC 
AGCTAATATTTCAAATATTTTAAACCCTGATTCTGTGGTTATTGGTGGCGGTGTCTCAGCAGCAGGTGAATTTTTACGTAGTCGCGTTG 
AGAAATACTTTGTC^CATTTGCTTTCCC^CAAGTTAAAAAGTCMCTAAAATTAAGAT 



SEQ ID NO. 1402: SAG0471 FROM THE 090 GBS TYPE la STRAIN 

CGTTTCTGATATCGTTGAATCTCTCAAACATCGTTTGAGCCTCTATGGATTAACAAAAGATGACTTTCTCGGTA 
CAGGAGCTGTTGATAGAACTAGTAAAACAGTAAGAGGTGCTTT^ 

AAAGAAGTTGGAATTCCATTTTTTATTGATAACGATGCTAATGTTGCAGCACTTGGTGAACGCTGGGTAGGTGCTGGTG 
• CGATGTTGTTTTCGTAACCCTCGGAACAGGAGTAGGTGGAGGTGTTATCGCAGAT^ 
GAGAAATTGGGCATATGATTGTTGATCGAGAKAATGGATTTAC^ 
ACAGGTGTTGTTAGAGTAGCACGTCAACTCGCAGAACAAT^ 

TACAAGTAAAGATATTTTTATAGCAGCAGAAGATGGGGATAAATTTGCTAATTCTGTTGTTGAACGTGTATCACGTTACCTTGGACTGG 

GAGCAGCTAATATTTCAAATATTTTAAACCCTGATTC^ 

GTTGAGAAATACTTTGTCACATTTG 

SEQ ID NO. 1403: SAG0471 FROM THE COH1 GBS TYPE la STRAIN 

ACAAGAAAAATGGGCAATTGAGACCAATACTTTAGA 

GCCTCTATGGATTAACAAAAGATGACTTTCTCGGTATCGGTATGGGTTCTCCAGGAGCTGTTGATAGAACTAGTAAAACAGTAA 
GCTTTTAATCTAAATTGGGCTGATACTCAAGA 

SEQ ID NO. 1404: SAG0471 FROM THE CJB110 GBS NONTYPEABIiE STRAIN 

TTGGTATCTTGACGCTTGAGGAGAAGTACAAGAAAAAT 

TCGTTGAATCTCTCAAACATCGTTTGAGCCTCTATGGATTAACAAAAGATGACTTTCTCGGTATCGGTATG 
GATAGAACTAGTAAAAC 

SEQ ID NO. 1405: SAG0471 FROM THE CJB110 GBS NONTYPEABLE STRAIN 

CACCAGCTAATATTTCAAATATTTTAAACCCTGATTCTGTGGTTATTG 
GTTGAGAAATACTTTGTCACATTTGCTTTCCCACAAGTTAAAAAGTCAACTA 

SEQ ID NO. 1406: SAG0471 FROM THE 2603V/R GBS TYPE V STRAIN 

GGGCAATTGAGACCAATACTTTAGAAAACGGAAGACATATCGTTTCTGATATCGTTGAATCTCTCAAACATCGTTTGAGCCTCTATGGA 
TTAACAAAAGATGACTTTCTCGGTATCGGTATGGGTTCTCCAGGAGCTG 

SEQ ID NO. 1407: SAG0471 FROM THE H36b GBS TYPE lb STRAIN 

GGCAATTGAGACCAATACTTTAGAAAACGGAAGACATATCGTTTCTGATATCGTTGAATCTCTCA 
TAACAAAAGATGACTTTCTCGGTATCGGTATGGGTTCTCCAGGAGCTGTTGATAGAACTAG 

AATTGGGCTGATACTCAAGAAGTAGGTTCAGTTATTGAAAAAGAAGTTGGAATTCCATTTTTTATTGATAACGATGCTAATG 
ACTTGGTGAACGCTGGGTAGGTGCTGGTGCCAATAATCCCGACGTTGTTTTCGTAACC 

SEQ ID NO. 1408: SAG0471 FROM THE H36 GBS TYPE lb STRAIN (REVERSE COMPLEMENT) 

GAGACAGTTGCATCAGCGACAGGTGTTGTTAGAGTAGCACG 

TGACAACGGTGATACTGTTACAAGTAAAGATATTTTTATAGCAGCAGAAGATGGGGATAAATTTGCT 

CACGTTACCTTGGACTGGCAGCAGCTAATATTTCAAATATTTTAAACCCTGATTCTGTGGTTATTGGTGGCGGTGTCTCAGC^ 
GAATTTTTACGTAGTCGCGTTGAGAAATACTTTGTCACATTTGCTTTCCCACA 

SEQ ID NO. 1409: SAG0471 FROM THE M732 GBS TYPE III STRAIN 

ACAAGAAAAATGGGCAATTGAGACCATACTTAGAAAACGG 
CTCTATGGATTAACAAAAGATGACTTTCTCGGTATC 

TTTTAATCTAAATTGGGCTGATACTCAAGAAGTAGGTTCGGTTATTGAAAAAGAAGTTGGAATTCCATTTTTTATTGATAACGATGCTA 

ATGTTGGAGCACTTGGTGAACGGTGGGTAGGTGCTGGTGCCAATA 

GGTGTTATCGCAGATGGTAACCTCATCCATGGTGTTGCAAGAGC^^ 

SEQ ID NO. 1410: SAG0471 FROM THE M732 GBS TYPE III STRAIN (REVERSE COMPLEMENT) 

CAGCAGCAGGTGAATTTTTACGTAGTCGCGTTGAGAAATACTTTGTC^ 
ATTGCTGAACTAGGTAATGAT 

SEQ ID NO. 1411: SAG0471 FROM THE M781 GBS TYPE III STRAIN 

AGAAGTACAAGAAAATGG^CAATTGAGACC!ATACTTAGAA 

TGAGCCT CTATGGATTAACAAAAGATGACTTTCT CGGTAT CGGTATGGGT TCT C CAGGAGCTGTTGATAGAACTAGTAAAACAGTAACA 
GGTGCTTTTAATCTAAATTGGGCTGATACTCAAGAAGTAGGTTCGGTTATTGAAAAAGAAGTTGGAATTCCATTTTTTATTGATAACGA 
TGCTAATGTTGCAGCACTTGGTGAACGCTGGGTAGGTGCTGGTGCCAATAATCCCGATGTTGTTTTCGTAACCCTCGGAACAGGAGTA 
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Table 14: Comparative Sequences relating to SAG0471 (glucokinase) 



SEQ ID NO. 1412: SAG0471 FROM THE M781 GBS TYPE III STRAIN (REVERSE COMPLEMENT) 

GATACTGTTACAAGTAAAGATATTTTTATAGCAGCAGAAGATGGGGATA^ 

TGGACTGGCAGCAGCTAATATTTCAAATATTTTAAACCCTG 

GTAGTCGCGTTGAGAAATACTTTGTCACATTTGCTTTCCCACAAGTTAAAAA 

SEQ ID NO. 1413: SAG0471 FROM THE 090 GBS TYPE la STRAIN 

AAATTTGGTATCTTGACGCTTGAGGGAGAAGTACAAGAAAAAT^ 

TATCGTTGAATCTCTCAAACATCGTTTGAGCCTCTATGGATTAACAAAAGATGACTTTCTCGGTATCGGTATGGGT 
TTGATAGAACTAGTAAAACAGTAACAGGTGCTTTTAATCTAAATTGGGCTGAT^ 
GGAATTCCATTTTTTATTGATAACGATGCTAATGTTGCAGCACTTGGTGAACGC^ 
TTTCGTAACC CTCGGAACAGGAGTAGGTGGAGG 

SEQ ID NO. 1414: SAG0471 FROM THE 090 GBS TYPE la STRAIN (REVERSE COMPLEMENT) 

GTGATACTGTTACAAGTAAAGATATTTTTATAGCAGCAGAAGATGGGGATAAATTTGCTAATTCTGTTGTTGAACGTGTATCACGTTAC 

CTTGGACTGGCAGCAGCTAATATTTCAAATATTTTAAACCCTGATTCTGTG^ 

ACGTAGTCGCGTTGAGAAATACTTTATCACATTTGCTTTCCCACAAGTT^ 

SEQ ID NO. 1415: SAG0471 FROM THE JM9130013 GBS TYPE VIII STRAIN (REVERSE COMPLEMENT) 

GTTATCGCAGATGGTAACCTCATCCATGGTGTTGCAGG^ 

GTGCACATGTGGTAACAAAGGCTGCCTTGAGACAGTTGC^ 

AGGGTTCGTCTGCCATTAAAGCAGCGATTGACCACGGTGATACTGTTAC^ 

TTTGCTAATTCTGTTGTTGAACGTGTATCACGTTACCTTGGACTGGCAGCAGCTAATATTTCAAATATTTTAAACCCTGATTCTGTGGT 

TATTGGTGGCGGTGTCTCAGCAGCAGGTGAATTTTTACGTAGTCGCGT^ 

AGTCAACTAA 

SEQ ID NO. 1416: SAG0471 FROM THE JM9130013 GBS TYPE VIII STRAIN (REVERSE COMPLEMENT) 

TGGTATCTTGACGCTTGAGGGAGAAGTACAAGAAAAATGGGCAATTGAGACCATACTTAGAAAACGGAAGA 

GTTGAATCTCTCAAACATCGTTTGAGCCTCTATGGATTAACAAAAGATGACTTTCTCGGTATCGGTATGGGTTCTCCAGGAG 

TAGAACTAGTAAAACAGTCACAGGTGCTTTTAATCTAAATTGGGCTGATACTCAAGAAGTA 

TTCCATTTTTTATTG 

SEQ ID NO. 1417: SAG0471 FROM THE 2603V/R TYPE V GBS STRAIN (REVERSE COMPLEMENT) 

AGCAGCTAATATTTCAAATATTTTAAACCCTGATTCTGTGGTTATTGGTG 
TTGAGAAATACTTTGTCACATTTGTTT TCCCACAAGGT 



SEQ1401_ 

SEQ1402 

SEQ1403 

SEQ1404 

SEQ1405 

SEQ1406 

SEQ1407 

SEQ1408 

SEQ1409 

SEQ1410 

SEQ1411 

SEQ1412 

SEQ1413 

SEQ1414 

SEQ14 15 T T AT CG CAGATGGTAAC CTCAT C CATGGTGT TG CAGGAGCAGGTGGAGAAA.T TGGG CAT 

SEQ1416 

SEQ1417 



SEQ1401_ 

SEQ1402 

SEQ1403 

SEQ1404 

SEQ1405 

SEQ1406 

SEQ1407 

SEQ1408 GAG 

SEQ1409 

SEQ1410 

SEQ1411 

SEQ1412 

SEQ1413 

SEQ 14 14 ~- 

SEQ14 15 TGATTGTTGATCCAGAAAATGGATTTACGTGCACATGTGGTAACAAAGGCTGCCTTGAG 

SEQ1416 

SEQ1417 
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Table 14: Comparative Sequences relating to SAG0471 (glucokinase) 



SEQ1401 

SEQ1402~ 

SEQ1403 

SEQ1404 

SEQ1405 

SEQ1406 

SEQ1407 

SEQ1408 

SEQ1409 

SEQ1410 

SEQ1411 

SEQ1412 

SEQ1413 

SEQ1414 

SEQ1415 

SEQ1416 

SEQ1417 



C^GTTGCATC^GCGACAGGTGTTGTTAGAGTAGC^CGTCAACTCGCAGAACAATATGAG 



C^GTTGCATC^GCGACAGGTGTTGTTAGAGTAGCACGTCAACTCGCAGAACAATATGAG 



SEQ1401_ 

SEQ1402" 

SEQ1403 

SEQ1404 

SEQ1405 

SEQ1406 

SEQX407 

SEQ14 08 

SEQ1409 

SEQ1410 

SEQ1411 

SEQ1412 

SEQ1413 

SEQ1414 

SEQ1415 

SEQ1416 

SEQ1417 



GTTCGTCTGCCATTAAAGCAGCGATTGACAACGGTGATACTGTTACAAGTAAAGATATT 

GATACTGTTACAAGTAAAGATATT 

GTGATACTGTTACAAGTAAAGATATT 

GTTCGTCTGCCATTAAAGCAGCGATTGACCACGGTGATACTGTTACAAGTAAAGATATT 



SEQ14 01_ TTAAATTTGGTATCTTGACGCTTGAGGGAGAAGTACAA 

SEQ1402 

SEQ1403 ACAA 

SEQ1404 TTGGTATCTTGACGCTTGAGG-AGAAGTACAA 

SEQ1405 

SEQ1406 

SEQ1407 

SEQ14 0 8 TTATAGCAGCAGAAGATGGGGATAAATTTGCTAATTCTGTTGTTGAACGTGTATCACGT 

SEQ1409 r ACAA 

SEQ1410 

SEQ14 11 AGAAGTACAA 

SEQ14 12 TTATAGCAGCAGAAGATGGGGATAAATTTGCTAATTCTGTTGTTGAACGTGTATCACGT 

SEQ14 13 AAATTTGGTATCTTGACGCTTGAGGGAGAAGTACAA 

SEQ14 14 TTATAGCAGCAGAAGATGGGGATAAATTTGCTAATTCTGTTGTTGAACGTGTATCACGT 

SEQ14 15 TTATAGCAGCAGAAGATGGGGATAAATTTGCTAATTCTGTTGTTGAACGTGTATCACGT 

SEQ1416 TGGTATCTTGACGCTTGAGGGAGAAGTACAA 

SEQ1417 -~ 
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Table 14: Comparative Sequences relating to SAG0471 (glucokinase) 



SEQ14 0 1_ AAAAATGGGCAATTGAGACCAATAGTTTAGAAAACGGAAGACATATCGTTTCTGATATC 

SEQ14 02 CGTTTCTGATATC 

SEQ14 0 3 AAAAATGGGCAATTGAGACCAATACTTTAGAAAACGGAAGACATATCGTTTCTGATATC 

SEQ14 04 AAAAATGGGCAATTGAGACCAATACTTTAGAAAACGGAAGACATATCGTTTCTGATATC 

SEQ14 05 CACCAGCTAATATTTCAAATATTTTAAACCCTGATTCTGTGGTTATT 

SEQ1406 GGGCAATTGAGACCAATACTTTAGAAAACGGAAGACATATCGTTTCTGATATC 

SEQ14 07 GGCAATTGAGACCAATACTTTAGAAAACGGAAGACATATCGTTTCTGATATC 

SEQ14 0 8 ACCTTGGACTGGCAGCAGCTAATATTTCAAATATTTTAAACCCTGATTCTGTGGTTATT 

SEQ140 9 AAAAAT G GG CAATTGAGACCA- TACT T-AGAAAACGGAAGA CAT AT CGTTTCTGATATC 

SEQ1410 

SEQ14 11 AAAA - TGGGCAATTGAGACCA - TACTT - AGAAAACGGAAGACATATCGTTTCTGATATC 

SEQ14 12 ACCTTGGACTGGCAGCAGCTAATATTTCAAATATTTTAAACCCTGATTCTGTGGTTATT 

SEQ14 1 3 AAAAATGGGCA - TTGAGACCA - TACTT - AGAAAACGGAAGACATATCGTTTCTGATATC 

SEQ14 14 ACCTTGGACTGGCAGCAGCTAATATTTCAAATATTTTAAACCCTGATTCTGTGGTTATT 

SEQ14 1 5 ACCTTGGACTGGCAGCAGCTAATATTTCAAATATTTTAAACCCTGATTCTGTGGTTATT 

SEQ141 6 AAAAATGGGCAATTGAGACCA - TACTT - AGAAAACGGAAGACATAT CGT T T CTGATAT C 

SEQ1417 AGCAGCTAATATTTCAAATATTTTAAACCCTGATTCTGTGGTTATT 

SEQ14 0 1_ TTGAATCTCTCA-AACATCGTTTGAGCCTCTATGGATTAACAAAAGATGACTTTCTCGG 

SEQ14 0 2 TTGAATCTCTCA-AACATCGTTTGAGCCTCTATGGATTAACAAAAGATGACTTTCTCGG 

SEQ14 0 3 TTGAATCTCTCA-AACATCGTTTGAGCCTCTATGGATTAACAAAAGATGACTTTCTCGG 

SEQ14 0 4 TTGAATCTCTCA-AACATCGTTTGAGCCTCTATGGATTAACAAAAGATGACTTTCTCGG 

SEQ14 0 5 GTGGCGGTGTCTCAGCAGCAGGTGAATTTTTACGTAGTCGCGTTGAGAAATACTTT 

SEQ14 0 6 TTGAATCTCTCIA-AACATCGTTTGAGCCTCTATGGATTAACAAAAGATGACTTTCTCGG 

SEQ14 07 TTGAATCTCTCA-AACATCGTTTGAGCCTCTATGGATTAACAAAAGATGACTTTCTCGG 

SEQ14 0 8 GTGGCGGTGTCTCAGCAGCAGGTGAATTTTTACGTAGTCGCGTTGAGAAATACTTT 

SEQ14 0 9 TTGAATCTCTCA-AACATCGTTTGAGCCTCTATGGATTAACAAAAGATGACTTTCTCGG 

SEQ14 10 CAGCAGCAGGTGAATTTTTACGTAGTCGCGTTGAGAAATACTTT 

SEQ14 11 TTGAATCTCTCA-AACATCGTTTGAGCCTCTATGGATTAACAAAAGATGACTTTCTCGG 

SEQ14 12 GTGGCGGTGTCTCAGCAGCAGGTGAATTTTTACGTAGTCGCGTTGAGAAATACTTT 

SEQ14 13 TTGAATCTCTCA-AAC^TCGTTTGAGCCTCTATGGATTAACAAAAGATGACTTTCTCGG 

SEQ14 14 GTGGCGGTGTCTCAGCAGCAGGTGAATTTTTACGTAGTCGCGTTGAGAAATACTTT 

SEQ14 15 GTGGCGGTGTCTCAGCAGCAGGTGAATTTTTACGTAGTCGCGTTGAGAAATACTTT 

SEQ1416 TTGAATCTCTCA-AACATCGTTTGAGCCTCTATGGATTAACAAAAGATGACTTTCTCGG 

SEQ14 17 GTGGCGGTGTCTCAGCAGCAGGTGAATTTTTACGTAGTCGCGTTGAGAAATACTTT 

SEQ14 0 1_ ATCGGTATGGGTTCTCCAGGAGCTGTTGATAGAACTAGTAAAACAGTAACAGGTGCTTT 

SEQ14 0 2 ATCGGTATGGGTTCTCCAGGAGCTGTTGATAGAACTAGTAAAACAGTAACAGGTGCTTT 

SEQ1403 ATCGGTATGGGTTCTCCAGGAGCTGTTGATAGAACTAGTAAAACAGTAACAGGTGCTTT 

SEQ14 04 ATCGGTATGGGGTCTCCAGGAGCTGTTGATAGAACTAGTAAAAC 

SEQ14 0 5 GTCACATTTGCTTTCCCACAAGTTAAAAAGTCAACTA 

SEQ14 0 6 ATCGGTATGGGTTCTCCAGGAGCTG 

SEQ14 0 7 ATCGGTATGGGTTCTCCAGGAGCTGTTGATAGAACTAGTAAAACAGTAACAGGTGCTTT 

SEQ14 0 8 GTCACATTTGCTTTCCCACA 

S EQ14 0 9 ATCGGTATGGGTTCTCCAGGAGCTGTTGATAGAACTAGTAAAACAGTAACAGGTGCTTT 

SEQ14 10 GTCACATTTGCTTTCCCACAAGTTAAAAAGTCAACTAAAATTAAGATTGCTGAACTAGG 

SEQ14 11 ATCGGTATGGGTTCTCCAGGAGCTGTTGATAGAACTAGTAAAACAGTAACAGGTGCTTT 

SEQ14 12 GTCACATTTGCTTTCCCACAAGTTAAAAA 

SEQ14 13 ATCGGTATGGGTT C TCCAGGAGCTGTTGATAGAACTAGTAAAACAGTAACAGGTGCTTT 

SEQ14 14 ATCACATTTGCTTTCCCACAAGTTAAAAAGTCAACTAAAATTAAGATTG 

SEQ14 15 GTCACATTTGCTTTCCCACAAGTTAAAAAGTCAACTAA 

SEQ14 1 6 ATCGGTATGGGTTCTCCAGGAGCTGTTGATAGAACTAGTAAAACAGTCACAGGTGCTTT 

SEQ1417 GTCACATTTGTTTTCCCACAAGGT 
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Table 14: Comparative Sequences relating to SAG0471 (glucokinase) 



SKQ1401_ 

SEQ1402 

SEQ1403 

SEQ1404 

SEQ1405 

SEQ1406 

SEQ1407 

SEQ1408 

SEQ1409 

SEQ1410 

SEQ1411 

SEQ1412 

SEQ1413 

SEQ1414 

SEQ1415 

SEQ1416 

SEQ1417 

SEQ1401_ 

SEQ1402 

SEQ1403 

SEQ1404 

SEQ1405 

SEQ1406 

SEQ1407 

SEQ1408 

SEQ1409 

SEQ1410 

SEQ1411 

SEQ1412 

SEQ1413 

SEQ1414 

SEQ1415 

SEQ1416 

SEQ1417 

SEQ1401_ 

SEQ1402" 

SEQ1403 

SEQ1404 

SEQ1405 

SEQ1406 

SEQ1407 

SEQ1408 

SEQ1409 

SEQ1410 

SEQ1411 

SEQ1412 

SEQ1413 

SEQ1414 

SEQ1415 

SEQ1416 

SEQ1417 



AATCTAAATTGGGCTGATACTCAAGAAGTAGGTTCAGTTATTGAAAAAGAAGTTGGAAT 
AATCTAAATTGGGCTGATACTCAAGAAGTAGGTTCGGTTATTGAAAAAGAAGTTGGAAT 
AATCTAAATTGGGCTGATACTCAAGA 

AATCTAAATTGGGCTGATACTCAAGAAGTAGGTTCAGTTATTGAAAAAGAAGTTGGAAT 

AATCTAAATTGGGCTGATACTCAAGAAGTAGGTTCGGTTATTGAAAAAGAAGTTGGAAT 

AATGAT 

AATCTAAATTGGGCTGATACTCAAGAAGTAGGTTCGGTTATTGAAAAAGAAGTTGGAAT 

AATCTAAATTGGGCTGATACTCAAGAAGTAGGTTCAGTTATTGAAAAAGAAGTTGGAAT 

AATCTAAATTGGGCTGATACTCAAGAAGTAGGTTCAGTTATTGAAAAAGAAGCTGGAAT 

CCATTTTTTATTGATAACGATGCTAATGTTGCAGCACTTGGTGAACGCTGGGTAGGTGC 
CCATTTTTTATTGATAACGATGCTAATGTTGCAGCACTTGGTGAACGCTGGGTAGGTGC 

CCATTTTTTATTGATAACGATGCTAATGTTGCAGCACTTGGTGAACGCTGGGTAGGTGC 
CCATTTTTTATTGATAACGATGCTAATGTTGCAGCACTTGGTGAACGCTGGGTAGGTGC 
CCATTTTTTATTGATAACGATGCTAATGTTGCAGCACTTGGTGAACGCTGGGTAGGTGC 
CCATTTTTTATTGATAACGATGCTAATGTTGCAGCACTTGGTGAACGCTGGGTAGGTGC 

CCATTTTTTATTG 

GGTGCCAATAATCCCGACGTTGTTTTCGTAACCCTCGGAACAGGAGTAGGTGGAGGTGT 
GGTGCCAATAATCCCGATGTTGTTTTCGTAACCCTCGGAACAGGAGTAGGTGGAGGTGT 

GGTGCCAATAATCCCGACGTTGTTTTCGTAACC 

GGTGCCAATAATCCCGATGTTGTTTTCGTAACCCTCGGAACAGGAGTAGGTGGAGGTGT 

GGTGCCAATAATCCCGATGTTGTTTTCGTAACCCTCGGAACAGGAGTA 

GGTGCCAATAATCCCGACGTTGTTTTCGTAACCCTCGGAACAGGAGTAGGTGGAGG 
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SEQ1401 
SEQ1402 
SEQ1403 
SEQ1404 
SEQ1405 
SEQ1406 
SEQ1407 
SEQ1408 
SEQ1409 
SEQ1410 
SEQ1411 
SEQ1412 
SEQ1413 
SEQ1414 
SEQ1415 
SEQ1416 
SEQ1417 

SEQ1401 
SEQ1402 
SEQ1403 
SEQ1404 
SEQ1405 
SEQ1406 
SEQ1407 
SEQ1408 
SEQ1409 
SEQ1410 
SEQ1411 
SEQ1412 
SEQ1413 
SEQ1414 
SEQ1415 
SEQ1416 
SEQ1417 

SEQ1401 
SEQ1402 
SEQ1403 
SEQ1404 
SEQ1405 
SEQ1406 
SEQ1407 
SEQ1408 
SEQ140 9 
SEQ1410 
SEQ1411 
SEQ1412 
SEQ1413 
SEQ1414 
SEQ1415 
SEQ1416 
SEQ1417 



Table 14: Comparative Sequences relating to SAG0471 (glucokinase) 



ATCGCAGATGGTAACCT(^TCCATGGTGTTGCAGGAGCAGGTGGAGAAATTGGGCATAT 
AT CG CAGATGGTAA C CT CAT C CATGGTGT TG CAGGAG CAGGTGGAGAAAT T GGG CATAT 



AT CG CAGATGGTAAC CT CAT C CAT GGTGT TG CAAGAG CAGGTGGAGAAATTGGG CAT AT 



ATTGTTGATCGA.GAAAATGGATTTACGTGCACATGTGGTAACAAAGGCTGCCTTGAGAC 
ATTGTTGATCGAGAKAATGGATTTACGTGCACATGTGGTAACAAAGGCTGTCTTGAGAC 



ATT 



GTTGCATCAGCGACAGGTGTTGTTAGAGTAGC1ACGTCAACTCGCAGAACAATATGAGGG 
GTTG(^TCAGCGACAGGTGTTGTTAaA.GTAGC^CGTCAACTCGCAGAACAATATGAAGG 
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Table 14: Comparative Sequences relating to SAG0471 (glucokinase) 



SEQ1401_ 

SEQ1402 

SEQ1403 

SEQ1404 

SEQ1405 

SEQ1406 

SEQ1407 

SEQ1408 

SEQ1409 

SEQ1410 

SEQ1411 

SEQ1412 

SEQ1413 

SEQ1414 

SEQ1415 

SEQ1416 

SEQ1417 

SEQ14 01 

SEQ1402" 

SEQ1403 

SEQ1404 

SEQ1405 

SEQ140 6 

SEQ1407 

SEQ1408 

SEQ1409 

SEQ1410 

SEQ1411 

SEQ1412 

SEQ1413 

SEQ1414 

SEQ1415 

SEQ1416 

SEQ1417 



TCGTCTGCCATTAAAGCAGCGATTGACACCGGTGATACTGTTACAAGTAAAGATATTTT 
TCGTCTGCCATTAAAGCAGCGATTGACAACGGTGATACTGTTACAAGTAAAGATATTTT 



ATAGCAGCAGAAGATGGGGATAAATTTGCTAATTCTGTTGTTGAACGTGTATCACGTTA 
ATAGCAGCAGAAGATGGGGATAAATTTGCTAATTCTGTTGTTGAACGTGTATCACGTTA 



SEQ1401 

SEQ1402" 

SEQ14 03 

SEQ1404 

SEQ1405 

SEQ1406 

SEQ1407 

SEQ140 8 

SEQ14 0 9 

SEQ1410 

SEQ1411 

SEQ1412 

SEQ1413 

SEQ1414 

SEQ1415 

SEQ1416 

SEQ1417 



CTTGGACTGGCAGCAGCTAATATTTCAAATATTTTAAACCCTGATTCTGTGGTTATTGG 
CTTGGACTGGCAGCAGCTAATATTTCAAATATTTTAAACCCTGATTCTGTGGTTATTGG 
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Table 14: Comparative Sequences relating to SAG0471 (glucokinase) 



SEQ1401_ 

SEQ14 02 

SEQ1403 

SEQ1404 

SEQ1405 

SEQ1406 

SEQ1407 

SEQ1408 

SEQ1409 

SEQ1410 

SEQ1411 

SEQ1412 

SEQ1413 

SEQ1414 

SEQ1415 

SEQ1416 

SEQ1417 

SEQ1401_ 

SEQ1402 

SEQ14 03 

SEQ1404 

SEQ1405 

SEQ1406 

SEQ1407 

SEQ1408 

SEQ140 9 

SEQ1410 

SEQX411 

SEQ1412 

SEQ1413 

SEQ1414 

SEQ1415 

SEQ1416 

SEQ14X7 



GGCGGTGTCTCAGCAGCAGGTGAATTTTTACGTAGTCGCGTTGAGAAATACTTTGTCAC 
GGCGGTGTCTCAGCAGCA.GGTGAATTTTTACGTAGTCGCGTTGAGAAATACTTTGTCAC 

— — — ~, — — — — — — — — — — — — — — — — — — — — — 

TTTGCTTTCCCACAAGTTAAAAAGTCAACTAAAATTAAGAT 
TTTG 
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Table 15: Comparative Sequences relating to SAG0492 



SEQ 10 NO. 1501: SAG0492 FROM THE 1169NT1 GBS NONTYPEABLE STRAIN 

TGACTTGGATATTCATCAAGGAGAAGTGGTGGTTATTATTGGC^ 
TGGAAGTACCAACAAAGGGAACAGTGACTTTTGAAGGAATTGATAT^ 
GGCATGGTTTTTCAACAGTTCAATCTATTTCCCAATATGACTGTACTAGA 
TAAGCTTGATGCTCAGACAAAAGCATACGAGCTACTTGAAAAAGTTGGACTCAA 

GAGGACZAACAACAACGGATTGCTATTGCAAGAGGTCT 

CCTGAAATGGTAGGTGAAGTCTTGACTGTTATGCAAGATTTAGCTAAATCTGGTATGACGATGGTTATTGTCACTCATGAAATGGGTTT 
TGCACGTGAAGTAGCGGATCGTGTCATTTTTATGGATGCA^ 

SEQ ID NO. 1502: SAG0492 FROM THE 18RS21 GBS TYPE II STRAIN 

TTGGGAAAAATGAGGTTTTAAAAGGCATTGACTTGGATATTCAT 

TCAACATTTTTAAGAAC^^TGAATCTCTTGGAAGTACCA 

TGATATTTTTAAAATGCGCGAAAAAATGGGCATGGTTTTTCAACAGTT 

TATCACCTATTAAGACAAAGGGGCTTTCTAATCTTGATGCTCAGACAAAAGCATATGAGCT^ 
GCTAATACTTATCCAGCTAGCTTATCTGGAGGACAACAACAACGAATTGCTATTGCAAGAGGT 

TTTTGATGAACCTACTTCAGCTCTTGATCCTGAAATGGTAGGTGAAGTCTTGACTGTTATGCAAGATTTAGCTAAATCTGGTATGACGA 
TGGTTATTGTCACTC^TGAAATGGGTTTTGCACGTGAAGTAGCGGATCGTGTCATTTTTATGGACGCAGAAATTAT 

SEQ ID NO. 1503: SAG0492 FROM THE 2603V/R GBS TYPE V STRAIN (REVERSE COMPLEMENT) 

AAAAATGAGGTTTTAAAAGGCATTGACTTGGATATTCATCAAGGAGAAGTA 

ATTTTTAAGAACAATGAATCTCTTGGAAGTACCAACAAAGGGAACAGTGACTTTTG^ 

TTTTTAAAATGCGCGAAAAAATGGGC^TGGTTTTTC^ACAG 

CCTATTAAGACAAAGGGGCTTTCTAATCTTGATGCTCAGACA^ 

TACTTATCCAGCTAGCTTATCTGGAGGACAACAACAACG 

ATGAACCTACTTCAGCTCTTGATCCTGAAATGGTAGGTGAAGTCTTGACTGTTATGCAAGATTTAGCTAAATCTGGTATGACGATGGTT 
ATTGTCACTCATGAAATGGGTTTTGCACGTGAAGTAGCGGATCGTGTCATT^ 

SEQ ID NO. 1504: SAG0492 FROM THE M781 GBS TYPE III STRAIN (REVERSE COMPLEMENT) 

GAGGTTTTAAAAGGCATTGACTTGGATATTCAT 

AAGAACAATGAATCTCTTGGAAGTACCAACAAAGGGAAC^GTGACTT 

AAATGCGCGAAAAAATGGGCATGGTTTTTCAACAGTTCAATCTATTTCCCAATATGACTGTACTAG 
AAGACAAAGGGACTTTCTAAGCTTGATGCTCAGACAAAAGCATACGAGCTACTTGAAAAAGTTGGACTCA 

TCCAGCAAGCTTATCTGGAGGACAACAACAACGGAT^ 

CTACTTCAGCTCTTGATCCTGAAATGGTAGGTGAAGTCTTGACTGTTATGCAAGATTTAGCTAAATCTGGTATGACGATGGTTATTGTC 
ACTCATGAAATGGGTTTTGCA.CGTGAAGTAGCGGATCGTGTC^TTTTTATGGATGaAGGGATTATTGTTGAGC^GGGACCCCTAAGAA 

AGTAT 

SEQ ID NO. 1505: SAG0492 FROM THE 090 GBS TYPE la STRAIN 

TGGTGGTTATTATTGGCCCTTCTGGCTCTGGTAAGTCA^ 

ACTTTTGAAGGGATTGATATAAC^GAC^AAAAGAATGATATTTTTAAAATGCGCGAAAAAATGGGCA 

ATTTCCCAATATGACTGTACTAGAAAATATTACTTTATCACCTATT^ 

ACGAGCTACTTGAAAAAGTTGGACTCAAAGAGAAG<3CTAATGC 

GCAAGAGGTCTTGCAATGAATCCTGATGTCCTTCTTTTTGATGAACCTACTTCAGCTCTTGATCCTGAAATGGTAGGTGAAG 

TGTTATGCAAGATTTAGCTAAATCTGGTATGACGATGGTTATTGTCACTCATGAAATGGGTTTTGCACGTGAAGTAGCGGATCG 

TTTTTATGGATGCAGGCATTATTGTTgAsCAAGGGACCCCTAAGGAAGTA 

SEQ ID NO. 1506: SAG0492 FROM THE A909 GBS TYPE la STRAIN 

CAATACAAGGACTTCATAAAAGTTTTGGGAAAAATGAGGTTTTA 

ATTGGCCCTTCTGGCTCTGGTAAGTCAACATTTTTAAGAAC^ 

GATTGATATAACAGACAAAAAGAATGATATTTTTA^ 

TGACTGTACTAGAAAATATTACTTTATCACCTATTAAGACAAAGGGGCTTTCTAAGCTrGATGCTCAGACAAAAGCATATGAGCTACTT 
GAAAAAGTTGGACTCAAAGAGAAGGCTAATACTTATCC^GCTAGCTTAT^ 

TGCAATGAATCCTGATGTCCTTCTTTTTGATGAACCTACTTCAGCTCTTGATCCTGAAATGGTAGGTGAAGTCTTGACTGTTATGCMG 

ATTTAGCTAAATCTGGTATGACGATGGTTATTGTCACTCATGAAATGGGTTTTGCACGTGAAGTAGCGGATCGTGTCATTTTTATGG^ 

GCAGGAATTATTGTgAGCAAGGGGCCCCTAAGGAAGTATTTGAGCAGACAAAAGA 

SEQ ID NO. 1507: SAG0492 FROM THE CJB110 GBS NONTYPEABLE STRAIN (REVERSE COMPLEMENT) 

GACTTGGATATTCATC1AAGGAGAAGTGGTGGTTATTATTGGCCCTTCTGGCTCTGGTAAGTCAACATTTTTAAGAACAATGAATCTCTT 

GGAAGTAC CAACAAAGG GAA CAGT GAC T T T TGAAGGGAT T GATAT AACAGACAAAAAGAATGATAT T T T TAAAATGC G CGAAAAAAT GG 

GCATGGTTTTTCAAGAGTTCAATCTATTTCCCAATATGACTGTACTAGAAAATATTACT 

AAGCTTGATGCTCAGACAAAAGCATACGAGCTACTTGAAAAA 

AGGACAAGAACAACGAATTGCTATTGGAAGAGGTCTTGGAA 

CTGAAATGGTAGGTGAAGTCTTGACTGTTATGCAAGATTTAGCTAAATCTGGTATGACGATGGTTATTGTCACTCATGAAATGGGTTTT 
GCACGTGAAGTAGCGGATCGTGTCTTTTTATGGATGCGGGAATTATTGTGAGCAAGGGACC 
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Table 15: Comparative Sequences relating to SAG0492 



SEQ ID NO. 1508: SAG0492 FROM THE H3 6b GBS TYPE lb STRAIN 

ATGAGGTTTTAAAAGGCATTGACTTGGATATTCATCAAGGAGAAGTAGTGGTTATTATTG 

TTAAGAACAATGAATCTCTTG4IAAGTAC 

TAAAATGCGCGAAAAAATGGGCATGGTTTTTCAACAGTTCAATCTATTTCCCAATATGACTGTACTAGAAAA 

TTAAGACAAAGGGGCTTTCTAAGCTTGATGCTCAGACAAA^ 

TATCCAGCTAGCTTATCTGGAGGACAACAACAACGAATTGCT^ 

ACCTACTTCAGCTCTTGATCCTGAAATGGTAGGTGAAGTCTTGACTGTTATGC!AAGATTTAGCTAAATCTGGTATGACGATGGTTATTG 
TCACTCATGAAATGGGTTTTGCACGTGAAGTAGCGGATCGTGTCATTTTTATGGATGCMGAATTATTGTTGAGCAAGGGGCCCCTAAG 

GAAGTAT 

SEQ ID NO. 1509: SAG0492 FROM THE JM9130013 GBS TYPE VIII STRAIN (REVERSE COMPLEMENT) 

GGTTTTAAAAGGCATTGACTTGGATATTCATCAAGGAGAA 

GAACAATGAATCTCTTGGAAGTACCAACAAAGGGAACAGTGAC 

ATGCGCGAAAAAATGGGCATGGTTTTTCAACAGTTC^TCTATTTCCC^TAT^ 

GACAAAGGGGCTTTCTAAGCTTGATGCTCAGACAAAAGCATATGAGCTACTTGAAAAAGTT 

CAGCTAGCTTATCTGGAGGACAACAACAACGAATTGCTATTGCA^ 

ACTTCAGCTCTTGATCCTGAAATGGTAGGTGAAGTCTTGACTGTTATGCAAGATT^ 

TCATGAAATGGGTTTTGCACGTGAAGTAGCGGATCGTGTCATTTTTATGGATGCAGGAATTATTGTTGAGCAAGGG 
TATTTAGCAAAACAAAAGAAAT 

SEQ ID NO. 1510: SAG0492 FROM THE M732 GBS TYPE III STRAIN 

GGTGGTTATTATTGGCCCTTCTGGCTCTGGTAAGTCAACATTTTTAAGAACAATGAAT 
CTTTTGAAGGGATTGATATAACAGACAAAAAGAATGATATTTTTAAAA 
TTTCCCAATATGACTGTACTAGAAAATATTACTTTATCACCTATTAAGAC^ 
CGAGCTACTTGAAAAAGTTGGACTCAAAGAGAAGGCTAATGCTTATCCAGCAAGCTTATCTGG 

SEQ ID NO. 1511: SAG0492 FROM THE COH1 GBS TYPE la STRAIN 

ATTGACTTGGATATTCATCAAGGAGAAGTGGTGGTTATTATTGGCCCTTC 

CTTGGAAGTACCAACAAAGGGAACAGTGACTTTTGAAGGGATTGATATAACAGACAAAAAGAATGATATTTTTA 

TGGGCATGGTTTTTCAACAGTTCAATCTATTTCCCAATATC^ 

TCTAAGCTTGATGCTCAGACAAAAGCATACGAGCT^ 

TGG 
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Table 15: Comparative Sequences relating to SAG0492 

SEQ1501 TGACTTGG 

SEQ1502 TTGGGAAAAATGAGGTTTTAAAAGGCATTGACTTGG 

SEQ150 3 AAAAATGAGGTT TTAAAAGGCATTGAC TTGG 

SEQ1504 GAGGTTTTAAAAGGC ATTGACTTGG 

SEQ1505 

SEQ150 6 AATACAAGGACTTCATAAAAGTTTTGGGAAAAATGAGGTTTTAAAAGGCATTGACTTGG 

SEQ1507 GACTTGG 

SEQ150 8 - ATGAGGTTTTAAAAGGCATTGACTTGG 

SEQ15 0 9 GGTTTTAAAAGGCATTGACTTGG 

SEQ1510 

SEQ1511 - ATTGACTTGG 

SEQ15 0 1 TATTCATCAAGGAGAAGTGGTGGTTATTATTGGCCCTTCTGGCTCTGGTAAGTCAACAT 

SEQ1502 TATTCATCAAGGAGAAGTAGTGGTTATTATTGGCCCTTCTGGCTCTGGTAAGTCAACAT 

S EQ1 5 0 3 TATTC ATCAAGGi\GAAGTAGTGGTTATTATTGGC C C TTC TGGCTCTGGTAAGTCAAC AT 

SEQ1504 TATTCATCAAGGAGAAGTGGTGGTTATTATTGGCCCTTCTGGCTCTGGTAAGTCAACAT 

SEQ150 5 - ■ TGGTGGTTATTATTGGCCCTTCTGGCTCTGGTAAGTCAACAT 

SEQ15 0 6 TATTC ATCAAGGAGAAGTAGTGGTTATTATTGGC CC TTC TGGCTC TGGTAAGTCAAC AT 

SEQ1507 TATTCATCAAGGAGAAGTGGTGGTTATTATTGGCCCTTCTGGCTCTGGTAAGTCAACAT 

SEQ150 8 TATTCATCAAGGAGAAGTAGTGGTTATTATTGGCCCTTCTGGCTCTGGTAAGTCAACAT 

SEQ150 9 T ATT CATC AAGGAGAAGTAGTGGTTATTATTGGC CC TTC TGGCTCTGGTAAGTCAACAT 

SEQ1510 GGTGGTT ATT AT TGGCCCTTCTGGCT C TGGTAAGT C AAC AT 

SEQ1511 TATTCATC AAGGAGAAGTGGTGGTTATTATTGGC CCTTCTGGC TCTGGTAAGTCAACAT 

SEQ150 1 TTTTAAGAAC AATGAAT CT CTTGGAAGTAC CAACAAAGGGAACAGTGACTTTTGAAGGAA 

SEQ15 0 2 TTTTAAGAACAATGAATCTCTTGGAAGTACCAACAAAGGGAACAGTGACTTTTGAAGGGA 

SEQ15 0 3 TTTTAAGAACAATGAATCT C TTGGAAGTACC AAC AAAGGGAAC AGTGAC TTTTGAAGGGA 

SEQ15 0 4 TTTTAAGAACAATGAATCTCTTGGAAGTACCAACAAAGGGAACAGTGACTTTTGAAGGGA 

SEQ15 0 5 TTT TAAGAAC AATGAATCTCTTGGAAGTAC CAACAAAGGGAAC AGTGAC TTTTGAAGGGA 

SEQ15 0 6 TTTTAAGAACAATGAATCT C TTGG AAGTAC C AAC AAAGGGAACAGTGAC TTTTGAAGGGA 

SEQ1507 TTTTAAGAACAATGAATCTCTTGGAAGTACCAACAAAGGGAACAGTGACTTTTGAAGGGA 

SEQ15 0 8 T TTTAAGAAC AATGAAT CTCTTGGAAGTACC AAC AAAGGGAAC AGTGACTTTTGAAGGGA 

SEQ15 0 9 TTTTAAGAACAATGAATCTCTTGGAAGTACCAACAAAGGGAACAGTGACTTTTGAAGGGA 

SEQ1510 TTTTAAGAAC AATGAAT C TCTTGGAAGTACC AAC AAAGGGAAC AGTGACTTTTGAAGGGA 

SEQ1511 TTTTAAGAACAATGAATCTCTTGGAAGTACCAACAAAGGGAACAGTGACTTTTGAAGGGA 

SEQ15 0 1 TTGATATAACAGACAAAAAAAATGATATTTTTAAAATGCGCGAAAAAATGGGCATGGTTT 

SEQ15 0 2 TTGATATAACAGACAAAAAGAATGATATTTTTAAAATGCGCGAAAAAATGGGCATGGTTT 

SEQ15 03 TTGATATAACAGACAAAAAGAATGATATTTTTAAAATGCGCGAAAAAATGGGCATGGTTT 

SEQ1504 TTGATATAACAGACAAAAAGAATGATATTTTTAAAATGCGCGAAAAAATGGGCATGGTTT 

SEQ15 0 5 TTGATATAACAGACAAAAAGAATGATATTTTTAAAATGCGCGAAAAAATGGGCATGGTTT 

SEQ15 0 6 TTGATATAACAGACAAAAAGAATGATATTTTTAAAATGCGCGAAAAAATGGGCATGGTTT 

SEQ1507 TTGATATAACAGACAAAAAGAATGATATTTTTAAAATGCGCGAAAAAATGGGCATGGTTT 

SEQ1508 TTGATATAACAGACAAAAAGAATGATATTTTTAAAATGCGCGAAAAAATGGGCATGGTTT 

SEQ15 0 9 TTGATATAACAGACAAAAAGAATGATATTTTTAAAATGCGCGAAAAAATGGGCATGGTTT 

SEQ15 10 TTGATATAACAGACAAAAAGAATGATATTTTTAAAATGCGCGAAAAAATGGGCATGGTTT 

SEQ1511 TTGATATAACAGACAAAAAGAATGATATTTTTAAAATGCGCGAAAAAATGGGCATGGTTT 

SEQ150 1 ' TTCAAC AGTTC AATCTATTTC C CAATATGACTGTAC TAGAAAATATT ACTTTATC AC CTA 

SEQ15 0 2 TT C AACAGT T C AATC TATTTCC CAATATGAC TGTAC TAGAAAATATTACTTTATC AC CTA 

SEQ15 0 3 TTCAACAGTTCAATCTATTTCCCAATATGACTGTACTAGAAAATATTACTTTATCACCTA 

SEQ150 4 TTC AAC AGTTC AATCTATTTC C CAATATGACTGTAC TAGAAAATATTACTTTATC AC C TA 

SEQ15 0 5 TT C AAC AGTT CAAT CTATTTC CC AATATGAC TGTACTAGAAAATATTAC TTTATC ACCTA 

SEQ15 0 6 TTC AAC AGTTC AATCTAT TTC C CAATATGACTGTAC T AGAAAATATTACTTTAT C AC CTA 

SEQ15 0 7 TT C AAC AGTT CAAT CTATTTCC CAAT ATGACTGT ACT AGAAAATATTACT TTATC ACCTA 

SEQ150 8 TTCAAC AGTTC AATCTATT T C CCAAT ATGACTGT ACTAGAAAATATTACTTTAT C AC C TA 

SEQ150 9 TTCAACAGTTCAATCTATTTCCCAATATGACTGTACTAGAAAATATTACTTTATCACCTA 

SEQ151 0 TTCAACAGTT.CAATCTATTTCCCAATATGACTGTACTAGAAAATATTACTTTATCACCTA 

SEQ1511 TT C AAC AGTTC AAT C TAT TT C C CAATATGACTGTAC TAG AAAATATTACTT TATC AC CTA 
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Table 15: Comparative Sequences relating to SAG0492 



SEQ1501 
SEQ1502 
SEQ1503 
SEQ1504 
SEQ1505 
SEQ1506 
SEQ1507 
SEQ1508 
SEQ1509 
SEQ1510 
SEQ1511 

SEQ1501 
SEQ1502 
SEQ1503 
SEQ1504 
SEQ1505 
SEQ1506 
SEQ1507 
SEQ1508 
SEQ15 0 9 
SEQ1510 
SEQ1511 

SEQ1501 
SEQ1502 
SEQ1503 
SEQ1504 
SEQ1505 
SEQ1506 
SEQ1507 
SEQ1508 
SEQ1509 
SEQ1510 
SEQX511 

SEQ1501 
SEQ1502 
SEQ1503 
SEQ1504 
SEQ1505 
SEQ1506 
SEQ1507 
SEQ1508 
SEQ1509 
SEQ1510 
SEQX511 

SEQ1501 
SEQ1502 
SEQ1503 
SEQ1504 
SEQ1505 
SEQ1506 
SEQ1507 
SEQ1508 
SEQ1509 
SEQ1510 
SEQ1511 



TTAAGACAAAGGGACTTTCTAAGCTTGATGCTCAGACAAAAGCATACGAGCTACTTGAAA 
TTAAGACAAAGGGGCTTTCTAATCTTGATGCTCAGACAAAAGCATATGAGCTACTTGAAA 
TTAAGACAAAGGGGCTTTCTAATCTTGATGCTCAGACAAAAGCATATGAGCTACTTGAAA 
TTAAGACAAAGGGACTTTCTAAGCTTGATGCTCAGACAAAAGCATACGAGCTACTTGAAA 
TTAAGACAAAGGGACTTTCTAAGCTTGATGCTCAGACAAAAGCATACGAGCTACTTGAAA 
TTAAGACAAAGGGGCTTTCTAAGCTTGATGCTCAGACAAAAGCATATGAGCTACTTGAAA 
TTAAGACAAAGGGACTTTCTAAGC TTGATGCTC AGACAAAAGCATACGAGCTAC TTGAAA 
TTAAGACAAAGGGGCTTTCTAAGCTTGATGCTCAGACAAAAGCATATGAGCTACTTGAAA 
TTAAGACAAAGGGGC TT TCTAAGCTTGATGC TCAGAC AAAAGCATATGAGCTACTTGAAA 
TTAAGACAAAGGGACTTTCTAAGCTTGATGCTCAGACAAAAGCATACGAGCTACTTGAAA 
TTAAGACAAAGGGACTTTCTAAGCTTGATGCTCAGACAAAAGCATACGAGCTACTTGAAA 

AAGTTGGACTCAAAGAGAAGGCTAATGCTTATCCAGCTAGCTTATCTGGAGGACAACAAC 
AAGTTGGACTCAAAGAGAAGGCTAATACTTATCCAGCTAGCTTATCTGGAGGACAACAAC 
AAGT TGG AC T C AAAG AG AAGG C T AAT AC T TAT C C AG C TAG C T T AT CTGGAGG AC AA CAAC 
AAGT TGG AC T C AAAG AG AAGG C T AATG C T TAT C C AG C AAG C T TAT C TGGAGGAC AAC AAC 
AAGT TGGAC T C AAAG AG AAGG C T AATG C T TAT C C AGC TAG C T TAT C TGG AGGG C AAC AA C 
AAGTTGGACTCAAAGAGAAGGCTAATACTTATCCAGCTAGCTTATCTGGAGGACAACAAC 
AAGTTGGACTC AAAGAGAAGGCTAATGC TTAT C CAGCTAGC TTATCTGGAGGAC AACAAC 
AAGT TGGAC T C AAAGAGAAGG C TAAT AC T TAT C C AGC TAG C T TAT C TGGAGGAC AAC AAC 
AAGTTGGACTCAAAGAGAAGGCTAATACTTATCCAGCTAGCTTATCTGGAGGACAACAAC 

AAGTTGGACTCAAAGAGAAGGCTAATGCTTATCCAGCAAGCTTATCTGG 

AAGTTGGAC TCAAAGAGAAGGCTAATGCTTATCCAGC AAGCTTATC TGGTAB CMARATVS 

ACGGATTGCTATTGCAAGAGGTCTTGCAATGAATCCTGATGTCCTTCTTTTTGATGAAC 
ACGAATTGCTATTGCAAGAGGTCTTGCAATGAATCCTCATGTCCTTCTTTTTGATGAAC 
ACGAATTGCTATTGCAAGAGGTCTTGCAATGAATCCTGATGTCCTTCTTTTTGATGAAC 
ACGGATTGCTATTGCAAGAGGTCTTGCAATGAATCCTGATGTCCTTCTTTTTGATGAAC 
ACGAATTGCTATTGCAAGAGGTCTTGCAATGAATCCTGATGTCCTTCTTTTTGATGAAC 
ACGAATTGCTATTGCAAGAGGTCTTGCAATGAATCCTGATGTCCTTCTTTTTGATGAAC 
ACGAATTGCTATTGCAAGAGGTCTTGCAATGAATCCTGATGTCCTTCTTTTTGATGAAC 
ACGAATTGCTATTGCAAGAGGTCTTGCAATGAATCCTGATGTCCTTCTTTTTGATGAAC 
ACGAATTGCTATTGCAAGAGGTCTTGCAATGAATCCTGATGTCCTTCTTTTTGATGAAC 



NCSRATNGTSAG- 



TACTTCAGCTCTTGATCCTGAAATGGTAGGTGAAGTCTTGACTGTTATGCAAGATTTAG 
TACTTCAG CTCTTGATC CTGAAATGGTAGGTGAAGTC TTGAC TGTTATGCAAGATTTAG 
TACT TC AGCT CTTGATC CTGAAATGGTAGGTGAAGTCTTGAC TGT TATGC AAGATTTAG 
TACTTCAGCTCTTGATCCTGAAATGGTAGGTGAAGTCTTGACTGTTATGCAAGATTTAG 
TACTTCAGCTCTTGATCCTGAAATGGTAGGTGAAGTCTTGACTGTTATGCAAGATTTAG 
TACTTCAGCTCTTGATCCTGAAATGGTAGGTGAAGTCTTGACTGTTATGCAAGATTTAG 
TACTTCAGCTCTTGATCCTGAAATGGTAGGTGAAGTCTTGACTGTTATGCAAGATTTAG 
TACTTCAGCTCTTGATCCTGAAATGGTAGGTGAAGTCTTGACTGTTATGCAAGATTTAG 
TACTTCAGCTCTTGATCCTGAAATGGTAGGTGAAGTCTTGACTGTTATGCAAGATTTAG 



TAAATCTGGTATGACGATGGTTATTGTCACTCATGAAATGGGTTTTGCACGTGAAGTAG 
TAAATCTGGTATGACGATGGTTATTGTCACTCATGAAATGGGTTTTGCACGTGAAGTAG 
TAAATCTGGTATGACGATGGTTATTGTCACTCATGAAATGGGTTTTGCACGTGAAGTAG 
TAAATCTGGTATGACGATGGTTATTGTCACTCATGAAATGGGTTTTGCACGTGAAGTAG 
TAAATCTGGTATGACGATGGTTATTGTCACTCATGAAATGGGTTTTGCACGTGAAGTAG 
TAAATCTGGTATGACGATGGTTATTGTCACTCATGAAATGGGTTTTGCACGTGAAGTAG 
TAAATCTGGTATGACGATGGTTATTGTCACTCATGAAATGGGTTTTGCACGTGAAGTAG 
TAAATCTGGTATGACGATGGTTATTGTCACTCATGAAATGGGTTTTGCACGTGAAGTAG 
TAAATCTGGTATGACGATGGTTATTGTCACTCATGAAATGGGTTTTGCACGTGAAGTAG 
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Table 15: Comparative Sequences relating to SAG0492 



SEQ1501 
SEQ1502 
SEQ1503 
SEQ1504 
SEQ1505 
SEQ1506 
SEQ1507 
SEQX508 
SEQ1509 
SEQ1510 
SEQ1511 



GGATCGTGTCATTTTTATGGATGCAGGCATTATTGT - GAGC AAGGGAC C CCTAAGGAAG 

GGATCGTGTC ATTTTTATGGACGCAGAAATTAT '■ 

GGATCGTGTCATTTTTATGGATGCAGGAATTATTGTTGAGCAAGGGGCCC 

GGATCGTGTCATTTTTATGGATGCAGGGATTATTGTTGAGCAAGGGACCCCTAAGAAAG 
GGATCGTGTCATTTTTATGGATGCAGGCATTATTGTTGASCAAGGGACCCCTAAGGAAG 
GGATCGTGTCATTTTTATGGATGCAGGAATTATTGTGAGCAAGGGGCCCCTAAGGAAGT 

GGATCGTGTC - TTTTTATGGATGCGGGAATTATTGT - GAGCAAGGGACC 

GGATCGTGTC ATTTTTATGGATGCASGAATTATTGTTGAGC AAGGGGCC C CTAAGGAAG 
GGAT CGTGTC ATTTTTATGGATGCAGGAATTATTGTTGAGCAAGGGGCC C CTAAGGAAG 



SEQ1501 
SEQ15 02 
SEQ1503 
SEQ1504 
SEQI505 
SEQ1506 
SEQ1507 
SEQ1508 
SEQ1509 
SEQ1510 
SEQ1511 



AT- 



AT- 
A- 



TTTGAGCAGACAAAAGAAATCCGCACAAGAGATTTCTT 



AT- 



ATTTAGCAAAACAAAAGAAAT • 
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Table 16: Comparative Sequences relating to SAG0767 (D-alanine - D-alanine ligase) 



SEQ ID NO. 1601: SAG0767 FROM THE M781 GBS TYPE III STRAIN 

TGGTCGCTCTGTCGGAACGTGAAGTATCTGTACTGTCTGCAGAAAGCGTCATGCGTGCTATTAATTATGATAAATTTTTT 
GTTAAAACTTATTTTATCACGCAAGTAGGTCAATTTATTAAAACACAAGAATTTGATGAAATGCCATCTTCAGATGAAAA 
GTTAATGACAAACCAAACTGTTGATTTAGACAAAATGGTTCGTCCAAGTGATATCTATGATGATAATGCAATTGTTTTCC 
CCGTTTTACATGGACCAATGGGGGAAGATGGTTCTATCCAAGGATTTTTAGAAGTTTTAAGGATGCCTTATGTTGGGACT 
AATATTCTATCTTCAAGCGTGGCTATGGATAAAATTACAACAAAACAAGTCCTTGCAACAGTAGGTGTACCTCAGGTTGC 
ATATCAAACTTATTTTGAGGGTGATGATTTGGAACATGCGATTAAACTCTCTTTAGAAACTTTAAGTTTCCCAATTTTTG 
TAAAAC CGGCTAATATGGGGTC ATCAGTAGGTATTTC AAAAGCGACAGATGAATC CTC ACT TCGCT CTGCAATTGAC TTA 
GCTCTCAAGTATGATAGCCGTATTTTGATTGAACAAGGCGTGACAGCTCGTGAAATTGAAGTAGGTATTTTAGGCAATAA 
TGATGTTAAGACAACTTTTCCTGGCGAAGTTGTTAAAGACGTCGATTTCTATGACTATGACGCCAAATATATTGATAATA 
AAATTACTATGGATATTCCAGCTAAAGTTGATGAAGCAACTATGGAAGCAATGCGTCAATATGCAAGTAAAGCTTTTAAA 
GCAATCGGGGCTTGTGGTTTATCACGCTGTGATTTCTTTTTGACGAAAGATGGACAAATCTTCTTAAACGAACTGAATAC 
AATGCCCGGTTTTACTCAGTGGTCAATGTATCCTCTGCTTTGGGAAAATATGGGGCTAACTTATAGTGATTTGATTG 

SEQ ID NO. 1602: SAG0767 FROM THE 090 GBS TYPE la STRAIN 

AAACCGGGCATTGTATTCAGTTCGTTTAAGAAGACTTGTCCATCTTTCGTCAAAAAGAAATCACAGCGTGATAAACCACA 
AGCCCCGATTGCTTTAAAAGCTTTACTTGCATATTGACGCATTGCTTCCATAGTTGCTTCATCAACTTTAGCTGGAATAT 
C C ATAGTAATTTTATTATC AATATATTTGGCGT C ATAGT CATAGAAAT CGACGTCT TTAACGACT TCGC CAGGAAAAGTT 
GTCTTAACATCATTATTGCCTAAAATACCTACTTCAATTTCACGAGCTGTCACGCCTTGTTCAATCAAAATACGGCTATC 
ATACTTGAGAGCTAAGTCAATksCAGAGCGAAGTGAGGATTCATCTGTCGCTTTTGAAATACCTACTGATGACCCCATAT 
TAGCCGGTTTTACAAAAATTGGGAAACTTAAAGTTTCTAAAGAGAGTTTAATCGCATGTTCCAAATCATCACCCTCAAAA 
TAAGTTTGATATGCAACCTGAGGTACACCTACTGTTGCAAGGACTTGTTTTGTTGTAATTTTATCCATAGCCACGCTTGA 
AGATAGAATATTAGTC C C AACATAAGGCATC CTTAAAACTTCTAAAAAT C CTTGGATAGAAC C AT CTTC C CC CATTGGTC 
CATGTAAAACGGGGAAAACAATTGCATTATCATCATAGATATCACTTGGACGAACCATTTTGTCTAAATCAACAGTTTGG 
TTTGTCATTAACTTTTCATCTGAAGATGGCATTTCATCAAATTCTTGTGTTTTAATAAATTGACCTACTTGCGTG 

SEQ ID NO. 1603: SAG0767 FROM THE COH1 TYPE la STRAIN 

TCGCTCTGCGGAACGTGAAGTATCTGTACTGTCTGCAGAAAGCGTCATGCGTGCTATTAATTATGATAAATTTTTTGTTA 
AAAC TTATTT TATC ACG C AAGTAGGTCAATT T ATTAAAACAC AAGAATTTGATGAAATGCCATCTTCAGATGAAAAGTTA 
ATGACAAAC C AAACTGTTGATTTAGAC AAAATGGTTCGTCCAAGTGATATCTATGATGATAATGCAATTGTTTTC C C CGT 
TTTAC ATGGACC AATGGGGGAAGATGGTTCTATC CAAGGATT TTTAGAAGTTTTAAGGATGC CTTATGTTGGGACTAATA 
TTCTATCTTCAAGCGTGGCTAT 

SEQ ID NO. 1604: SAG0767 FROM THE CJB110 GBS NONTYPEABLE STRAIN (REVERSE 
COMPLEMENT) 

CGTCGATTTCTATGACTATGACGCCAAATATATTGATAATAAAATTACTATGGATATTCCAGCTAAAGTTGATGAAGCAA 
CTATGGAAGCAATGCGTCAATATGCAAGTAAAGCTTTTAAAGCAATCGGGGCTTGTGGTTTATCACGCTGTGATTTCTTT 
TTGACGAAAGATGGAC AAATCTT C TTAAACGAACTGAATAC AATGCCC 

SEQ ' ID NO. 1605: SAG0767 FROM THE CJB110 GBS NONTYPEABLE STRAIN 

AACGTGAAGTATCTGTACTGCTCTGCAGAAAAGCGTCATGCGTGCTATTAATTATGATAAATTTTTTGTTAAAACTTATT 
TTATCACGCAAGTAGGTCAATTTATTAAAACACAAGAATTTGATGAAATGCCATCTTCAGATGAAAA 

SEQ ID NO. 1606: SAG0767 FROM THE 1169NT1 GBS TYPE V STRAIN (REVERSE COMPLEMENT) 

CTAATATGGGGTCATCAGTAGGTATTTCAAAAGCGACAGATGAATCCTCACTTCGCTCTGCAATTGACTTAGCTCTCAAG 
TATGATAGCCGTATTTTGATTGAACAAGGCGTGACAGCTCGTGAAATTGAAGTAGGTATTTTAGGCAATAATGATGTTAA 
GACAACTT TT C C TGGCGAAGTCGTTAAAGACGTCGATT TCT ATGAC TATGACGC C AAATATATTGATAATAAAATTACTA 
TGGATATTCCAGCTAAAGTTGATGAAGCAACTATGGAAGCAATGCGTCAATATGCAAGTAAAGCTTTTAAAGCAATCGGG 
GCTTGTGGTTTATCACGCTGTGATTTCTTTTTGACGAAAGATGGACAAATCTTCTTAAACGAACTGAATACAATGCCCGG 
TTTTACTCAGTGGTCAATGTATCCTCTGCTTTGGGAAAAT 

SEQ ID NO. 1607: SAG0767 FROM THE 18RS21 GBS TYPE II STRAIN (REVERSE COMPLEMENT) 

TTGACTTAGCTCTCAAGTATGATAGCCGTATTTTGATTGAACAAGGCGTGACAGCTCGTGAAATTGAAGTAGGTATTTTA 
GGCAATAATGATGTTAAGACAAC T TTTC CTGG CGAAGT CGTTAAAGACGTCGATTTCTATGACTATGACGC C AAAT ATAT 
TGAT AATAAAATTAC TATGGATATT C CAGCTAAAGT TGATGAAGC AAC TATGGAAGC AATGCGTC AATATGC AAGTAAAG 
CTTTTAAAGCAATCGGGGCTTGTGGTTTATCACGCTGTGATTTCTTTTTGACGAAAGATGGACAAATCTTCTTAAACGAA 
CTGAATACAATGCCCGGTTTTACTCAGTGGTCAATGTATCCCCTGCTTTGGGAAAAGTATGGGGCTAACCTT 

SEQ ID NO. 1608: SAG0767 FROM THE 18RS21 GBS TYPE II STRAIN 

ATCTGTACTGTCTGCAGAAAAGCGTCATGCGTGCTATTAATTATGATAAATTTTTTGTTAAAACTTATTTTATCACGCAA 
GTAGGTCAATTTATTAAAACACAAGAATTTGATGAAATGCCATCTTCAGATGAAAAGTTAATGACAAACCAAACTGTTGA 
TTTAGACAAAATGGTTCGTCCAAGTGATATCTATGATGATAATGCAATTGTTTTCCCCGTTTTACATGGACCAATGGGGG 
AAGATGGTTCTATCCAAGGATTTTTAGAAGTTTTAAGGATGCCTTATGTTGGGACTAATATTCTATCTTCAA 
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Table 16: Comparative Sequences relating to SAG0767 (D-alanine - D-alanine ligase) 



SEQ ID NO. 1609: SAG0767 FROM THE 2603V/R GBS TYPE V STRAIN (REVERSE COMPLEMENT) 

GGCTATGGATAAAATTACAACAAAACAAGTCCTTGCAACAGTAGGTGTACCTCAGGTTGCATATCAAACTTATTTTGAGG 
GTGATGATTTGGAACATGCGATTAAACTCTCTTTAGAAACTTTAAGTTTCCCAATTTTTGTAAAACCGGCTAATATGGGG 
T(^TCAGTAGGTATTTO^AAGCGACAGATGAATCCTCACTTCGCTCTGC^TTGACTTAGCTCTCAAGTATGATAGCCG 
TATTTTGATTGAACAAGGCGTGACAGCTCGTGAAATTGAAGTAGGTATTTTAGGCAATAATGATGTTAAGACAACTTTTC 
CTGGCGAAGTCGTTAAAGACGTCGATTTCTATGACTATGACGCCAAATATATTGATAATAAAATTACTATGGATATTCCA 
GCTAAAGTTGATGAAGCAACTATGGAAGCAATGCGTCAATATGCAAGTAAAGCTTTTAAAGCAATCGGGGCTTGTGGTTT 
ATCACGCTGTGATTTCTTTTTGACGAAAGAATGGACAAATCTTCTTAAACGAACTGAAATAC 

SEQ ID NO. 1610: SAG0767 FROM THE 2603V/R <3BS TYPE V STRAIN 

TCTGTACTGTCTGCAGAAAGCGTCATGCGTGCTATTAATTATGATAAATTTTTTGTTAAAACTTATTTTATCACGCAAGT 
AGGTCAATTTATTAAAACACAAGAATTTGATGAAATGCCATCTTCAGATGAAAAGTTAATGACAAACCAAACTGTTGATT 
TAGACAAAATGGTTCGTCCAAGTGATATCTATGATGATAAT 

SEQ ID NO. 1611: SAG0767 FROM THE H36b GBS TYPE lb STRAIN (REVERSE COMPLEMENT) 

AAAACCGGCTAATATGGGGTCATCAGTAGGTATTTCAAAAGCGACAGATGAATCCTCACTTCGCTCTGCAATTGACTTAG 
CTCTCAAGTATGATAGCCGTATTTTGATTGAACAAGGCGTGACAGCTCGTGAAATTGAAGTAGGTATTTTAGGCAATAAT 
GATGTTAAGACAAC TTTTCCTGGCGAAGTCGTTAAAGACGTCGATT TC TATGACT ATGACGCC AAATATATTGATAATAA 
AATTACTATGGATATTCCAGCTAAAGTTGATGAAGCAACTATGGAAGCAATGCGTCAATATGCAAGTAAAGCTTTTAAAG 
CAATCGGGGCTTGTGGTTTATCACGCTGTGATTTCTTTTTGACGAAAGATGGACAAATCTTCTTAAACGAACTGAATACA 
ATGCCCGGTTTTACTCAGTGGTCAATGTATCCCCTGCTTTGGGAAAATATGGGGCTAACTTATAG 

SEQ ID NO. 1612: SAG0767 FROM THE H36b TYPE lb STRAIN 

CGTGAAGTATCTGTACTGTCTGCAGAAAGCGTCATGCGTGCTATTAATTATGATAAATTTTTTGTTAAAACTTATTTTAT 
CACGCAAGTAGGTCAATTTATTAAAACACAAGAATTTGATGAAATGCCATCTTCAGATGAAAAGTTAATGACAAACCAAA 
CTGTTGATTTAGACAAAATGGTTCGTCCAAGTGATATCTATGATGATAATGCAATTGTTTTCCCCGTTTTACATGGACCA 
ATGGGGGAAGATGGTTCTATCCAAGGATTTTTAGAAGTTTTAAGGATGCCTTATGTTGGGACTAATATTCTATCTTCAAG 
CGTGGC TATGGATAAAATTAC AACAAAACAAGTC CTTGC AACAGTAG 

SEQ ID NO. 1613: SAG0767 FROM THE M732 GBS TYPE III STRAIN (REVERSE COMPLEMENT) 

ATGCGATTAAACTCTCTTTAGAACCTTTAAGTTTCCCAATTTTTGTAAACCCGGCTAATATGGGGTCATCAGTAGGTATT 
TCAAAAGCGACAGATGAATCCTCACTTCGCTCTGCAATTGACTTAGCTCTCAAGTATGATAGCCGTATTTTGATTGAACA 
AGGCGTGACAGCTCGTGAAATTGAAGTAGGTATTTTAGGCAATAATGATGTTAAGACAACTTTTCCTGGCGAAGTTGTTA 
AAGACGTCGATTTCTATGACTATGACGCCAAATATATTGATAATAAAATTACTATGGATATTCCAGCTAAAGTTGATGAA 
GCAACTATGGAAGCAATGCGTCAATATGCAAGTAAAGCTTTTAAAGCAATCGGGGCTTGTGGTTTATCACGCTGTGATTT 
CTTTTTGACGAAAGATGGACAAATCTTCTTAAACGAACTGAATACAATGCCCGGTTTTACTCAGTGGTCAATGTATCCTC 
TGCTTTGGGAAAATATGGGGCTAACTT 

SEQ ID NO. 1614: SAG0767 FROM THE M732 GBS TYPE III STRAIN 

GTC ATGCCGTGC TATTAATTATGATAAATTTT TTGTTAAAACTTATTTTAT CACGCAAGTAGGT CAATTTATTAAAACAC 
AAGAATTTGATGAAATGCCATCTTCAGATGAAAAGTTAATGACAAACCAAACTGTTGATTTAGACAAAATGGTTCGTCCA 
AGTGATATCTATGATGATAATGCAATTGTTTTCCCCGTTTTACATGGACCAATGGGGGAAGATGGTTCTATCCAAGGATT 
TTTAGAAGTTTTAAGGATGC C TTATGTTGGGACTAATATTCTATCTTCAAGCGTGGC TATGGATAAAATTAC AACAAAAC 
AAGTCCTTGCAACAGTAGGTGTACCTCAGG 

SEQ ID NO. 1615: SAG07 67 FROM THE A909 GBS TYPE la STRAIN (REVERSE COMPLEMENT) 

TTTTGAGGGTGATGATTTGGAACATGCGATTAAACTCTCTTTAGAAACTTTAAGTTTCCCAATTTTTGTAAAACCGGCTA 
ATATGGGGTCATCAGTAGGTATTTCAAAAGCGACAGATGAATCCTCACTTCGCTCTGCAATTGACTTAGCTCTCAAGTAT 
GATAGCCGTATTTTGATTGAACAAGGCGTGACAGCTCGTGAAATTGAAGTAGGTATTTTAGGCAATAATGATGTTAAGAC 
AAC TTTTCC TGGCGAAGT CGTTAAAGACGTCGATTTCTATGACTATGACGC CAAATATATTGATAATAAAATTACTATGG 
ATAT TC CAGC TAAAGT TGATGAAGC AACTATGGAAGC AATGCGT CAATATGC AAGTAAAGCTTTTAAAGC AAT CGGGGCT 
TGTGGTTTATCACGCTGTGATTTCTTTTTGACGAAAGATGGACAAATCTTCTTAAACGAACTGAATACAATGCCCGGTTT 
TACTCAGTGGTCAATGTATCCCCTGCTTTGGGAAAATATGGGGCTAACTTATAGTGA 

SEQ ID NO. 1616: SAG0767 FROM THE A909 GBS TYPE la STRAIN 

TGGTCGCTCTGCGGAACGTGAAGTATCTGTACTGTCTGCAGAAAGCGTCATGCGTGCTATTAATTATGATAAATTTTTTG 
TTAAAACTTATTTTATCACGCAAGTAGGTCAATTTATTAAAACACAAGAATTTGATGAAATGCCATCTTCAGATGAAAAG 
TTAATGAC AAAC C AAAC TGTTGATTTAGACAAAATGGTTCGT C C AAGTGATATC TATGATGATAATGC AATTGTTTTCC C 
CGTTTTACATGGACCAATGGGGGAAGATGGTTCTATCCAAGGATTTTTAGAAGTTTTAAGGATGCCTTATGTTGGGACTA 
ATATTC TAT CTT CAAGCGTGGCTATGGATAAAATTAC AAC AAAAC AAGTCC TTGCAAC AGTAGG 
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Table 16: Comparative Sequences relating to SAG0767 (D-alanine - D-alanine ligase) 



SEQ ID NO. 1617: SAG0767 PROM THE JM9130013 GBS TYPE VIII STRAIN (REVERSE 
COMPLEMENT) 

AAGCAGGGGATACATTGACCACTGAGTAAAACCGGGCATTGTATTCAGTTCGTTTAAGAAGATCTGTCCATCTTTCGTCA 
AAAAGAAATCACAGCGTGATAAACCACAAGCCCCGATTGCTTTAAAAGCTTTACTTGCATATTGACGCATTGCTTCCATA 
.GATGCTTCATCAACTTTAGCTGGAATATCCATAGCAATTTTATTATCAATATATTTGGCG 



SEQ1601 
SEQ16 02 
SEQ1603 
SEQ1604 
SEQ1605 
SEQ1606 
SEQ1607 
SEQ1608 
SEQ1609 
SEQ1610 
SEQ1611 
SEQ1612 
SEQ1613 
SEQ1614 
SEQ1615 
SEQ1616 
SEQ1617 



GGTCGCTCTGTCGGAACGTGAAGTATCTGTACTGTCTGCAGAAAGCGTCATGCGTGCTA 



SEQ1601 
SEQ1602 
SEQ1603 
SEQ1604 
SEQ1605 
SEQ1606 
SEQ1607 
SEQ1608 
SEQ1609 
SEQ1610 
SEQ1611 
SEQ1612 
SEQ1613 
SEQ1614 
SEQ1615 
SEQ1616 
SEQ1617 



TAATTATGATAAATTT TTTGTTAAAACTTAT TTTATC ACGCAAGTAGGT C AATTTATTA 



SEQ1601 
SEQ1602 
SEQ1603 
SEQ1604 
SEQ1605 
SEQ160 6 
SEQ1607 
SEQ1608 
SEQ1609 
SEQ1610 
SEQ1611 
SEQ1612 
SEQ1613 
SEQ1614 
SEQ1615 
SEQ1616 
SEQ1617 



AACACAAGAATTTGATGAAATGCCATCTTCAGATGAAAAGTTAATGACAAACCAAACTG 
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Table 16: Comparative Sequences relating to SAG0767 (D-alanine - D-alanine ligase) 



SEQ1601 
SEQ1602 
SEQ1603 
SEQ16 04 
SEQ1605 
SEQ1606 
SEQ1607 
SEQ1608 
SEQ1609 
SEQ1610 
SEQ1611 
SEQ1612 
SEQ1613 
SEQ1614 
SEQ1615 
SEQ1616 
SEQ1617 



TGATTTAGACAAAATGGTTCGT CC AAGTGATATCT ATGATGATAATGC AATTGTTTTC C 



SEQ1601 
SEQ1602 
SEQ1603 
SEQ1604 
SEQ1605 
SEQ1606 
SEQ1607 
SEQ1608 
SEQ1609 
SEQ16X0 
SEQ1611 
SEQ1612 
SEQ1613 
SEQ1614 
SEQ1615 
SEQ1616 
SEQ1617 



CGTTTTACATGGACCAATGGGGGAAGATGGTTCTATCCAAGGATTTTTAGAAGTTTTAA 



SEQ1601 
SEQ1602 
SEQ1603 
SEQ1604 
SEQ1605 
SEQX606 
SEQ1607 
SEQ1608 
SEQ1609 
SEQ1610 
SEQ1611 
SEQ1612 
SEQ1613 
SEQ1614 
SEQ1615 
SEQ1616 
SEQ1617 



GATGC CTTATGTTGGGACTAATATT CTATCTTC AAGCGTGGCTATGGATAAAATTACAA 



• GGCTATGGATAAAATTACAA 
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Table 16: Comparative Sequences relating to SAG0767 (D-alanine - D-alanine ligase) 



SEQ1601 
SEQ1602 
SEQ1603 
SEQ1604 
SEQ1605 
SEQ1606 
SEQ1607 
SEQ1608 
SEQ16 0 9 
SEQ1610 
SEQ1611 
SEQ1612 
SEQ1613 
SEQ1614 
SEQ1615 
SEQ1616 
SEQ1617 

SEQ1601 
SEQ1602 
SEQ1603 
SEQ1604 
SEQ1605 
SEQ1606 
SEQ1607 
SEQ1608 
SEQ1609 
SEQ1610 
SEQ1611 
SEQ1612 
SEQ1613 
SEQ1614 
SEQ1615 
SEQ1616 
SEQ1617 

SEQ1601 
SEQ1602 
SEQ1603 
SEQ1604 
SEQ1605 
SEQ1606 
SEQ1607 
SEQ1608 
SEQ1609 
SEQ1610 
SEQX611 
SEQ1612 
SEQ1613 
SEQ1614 
SEQ1615 
SEQ1616 
SEQ1617 



AAAACAAGT C CTTGC AACAGTAGGTGT AC CT CAGGTTGC ATATCAAAC TTATTTTGAGG 

AAAAC AAGT C CTTGC AAC AGTAGGTGTACC TCAGGTTGCATAT CAAACT TATTTTGAGG 

r TTTTGAGG 

TGATGATTTGGAACATGCGATTAAACTCTCTTTAGAAACTTTAAGTTTCCCAATTTTTG 

TGATGATTTGGAACATGCGATTAAACTCTCTTTAGAAACTTTAAGTTTCCCAATTTTTG 

ATGCGATTAAACTCTCTTTAGAACCTTTAAGTTTCCCAATTTTTG 

TGATGATTTGGAACATGCGATTAAACTCTCTTTAGAAACTTTAAGTTTCCCAATTTTTG 

AAAACCGGC TAATATGGGGTCAT C AGTAGGTATTTCAAAAGCGAC AGATGAATC C TC AC 

AAACCGGGC 

-TCGCTCTGCGGAACGTGAAGTATCTGTACTG- TCTGC AGAAA - GCGT 

AACGTGAAGTATCTGTACTGCTCTGCAGAAAAGCGT 

CTAATATGGGGTCATCAGTAGGTATTTCAAAAGCGACAGATGAATCCTCAC 

ATCTGTACTG - TCTGCAGAAAAGCGT 

AAAACCGGCTAATATGGGGTCATCAGTAGGTATTTCAAAAGCGACAGATGAATCCTCAC 

TCTGTACTG - TCTGC AGAAA - GCGT 

AAAACCGGCTAATATGGGGTCATCAGTAGGTATTTCAAAAGCGACAGATGAATCCTCAC 

CGTGAAGT ATCTGTACTG- TCTGCAGAAA-GCGT 

AAAC C CGGCTAATATGGGGT C ATC AGTAGGTATTTC AAAAGCGACAGATGAATC CTC AC 

GT 

AAAAC CGGC TAATATGGGGTCAT C AGTAGGTATTTC AAAAGCGACAGATGAATC CTC AC 

TGGTCGCTCTGCGGAACGTGAAGTATCTGTACTG- TCTGCAGAAA- GCGT 

AAGCAGGGGATACATTGACCACTGAGTAAAACCGGGC 



SEQ1601 
SEQ1602 
SEQ1603 
SEQ1604 
SEQ1605 
SEQ1606 
SEQ1607 
SEQ1608 
SEQ1609 



TCGCTCTGCAATTGACTTAGCTCTCAAGTATGATAGCCGTATTTTGATTGAACAAGGCG 
TTGT - ATTCAGTTCGTTTAAGAAGACTTGTCCATCTTTCGTCAAAAAGAAATCACAGCG 
ATGC-GTGCTATTAATTATGATAAATTTTTTGTTAAAACTTATTTTATCACGCAAGTAG 

ATGC - GTGCTAT T AATTATGATAAAT TTTTTGTTAAAAC TTATT TT AT C ACGC AAGTAG 
TCGCTCTGCAATTGACTTAGCTCTCAAGTATGATAGCCGTATTTTGATTGAACAAGGCG 

TTGACTTAGCTCTCAAGTATGATAGCCGTATTTTGATTGAACAAGGCG 

ATGC - GTGC TATTAAT TATGATAAATTTT T TGTTAAAAC TT ATTTTATC ACGCAAGTAG 
T CGCT CTGCAATTGAC TTAGC T C T C AAGTATGATAGC CGTAT TTTGATTGAAC AAGG CG 
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Table 16: Comparative Sequences relating to SAG0767 (D-alanine - D-alanine ligase) 



SEQl 610 ATGC - GTGCTATTAATTATGATAAATTTTTTGTTAAAACTTATTTTATCACGCAAGTAG 

SEQ1611 TCGCTCTGCAATTGACTTAGCTCTCAAGTATGATAGCCGTATTTTGATTGAACAAGGCG 

SEQ1612 ATGC - GTGCTATTAATTATGATAAATTTTTTGTTAAAACTTATTTTATCACGCAAGTAG 

SEQ16 13 TCGCTCTGCAATTGACTTAGCTCTCAAGTATGATAGCCGTATTTTGATTGAACAAGGCG 

SEQ1614 ATGC CGTGCTATTAATTATGATAAATTTT TTGTTAAAAC TTATTTTATCACGCAAGTAG 

SEQ16 1 5 TCGCTCTGCAATTGACTTAGCTCTCAAGTATGATAGCCGTATTTTGATTGAACAAGGCG 

SEQ1616 ATGC - GTGCTATTAATTATGATAAATTTTTTGTTAAAACTTATTTTATCACGCAAGTAG 

SEQl 6 17 TTGT -ATTCAGTTCGTTTAAGAAGATCTGTCCATCTTTCGTCAAAAAGAAATCACAGCG 

SEQ1601 G AC AG C T C GTG AAATT G AAGT AGGTAT T TT AGG C AAT AAT G ATGT TAAG AC AAC T T T T C 

SEQ16 0 2 GATAAAC CACAAGC CCCGATTGCTTTAAAAGCTTTACTTGCATATTGACGCATTG 

SEQ1603 GTCAATTTATTAAA AC ACAAGAATTTGATGAAATGC C ATC TTCAGATGAAAAGTTA 

SEQ1604 

SEQ16 0 5 GTCAATTTATTAAA ACACAAGAATTTGATGAAATGCCATCTTCAGATGAAAA 

SEQ1606 GACAGCTCGTGAAATTGAAGTAGGTATTTTAGGCAATAATGATGTTAAGACAACTTTTC 

SEQ1607 GACAGCTCGTGAAATTGAAGTAGGTATTTTAGGCAATAATGATGTTAAGACAACTTTTC 

SEQ16 0 8 GTCAATTTATTAAA ACACAAGAATTTGATGAAATGC CATC TTCAGATGAAAAGTTA 

SEQl 6 0 9 , GACAGCTCGTGAAATTGAAGTAGGTATTTTAGGCAATAATGATGTTAAGACAACTTTTC 

SEQ16 1 0 GTCAATTTATTAAA ACACAAGAATTTGATGAAATGCCATCTTCAGATGAAAAGTTA 

SEQ1611 GACAGCTCGTGAAATTGAAGTAGGTATTTTAGGCAATAATGATGTTAAGACAACTTTTC 

SEQl 612 GTCAATTTATTAAA ACACAAGAATTTGATGAAATGCCATCTTCAGATGAAAAGTTA 

SEQ1613 GACAGCTCGTGAAATTGAAGTAGGTATTTTAGGCAATAATGATGTTAAGACAACTTTTC 

SEQ1614 GTCAATTTATTAAA- - -ACACAAGAATTTGATGAAATGCCATCTTCAGATGAAAAGTTA 

S EQ1 6 1 5 GACAGCTCGTGAAATTGAAGTAGGTATTTTAGGCAATAATGATGTTAAGACAACTTTTC 

SEQl 616 GTCAATTTATTAAA ACACAAGAATTTGATGAAATGCCATCTTCAGATGAAAAGTTA 

SEQl 6 17 GATAAAC CACAAGC CCCGATTGCTTTAAAAGCTTTACTTGCATATTGACGCATTG 

SEQ1601 TGGCGAAGTTGTTAAAGACGTCGATTTCTATGA- - CTATGACGCCAAAT - ATATTGATA 

SEQ16 0 2 TTCCATAGTT GCTT C AT CAACTTTAGCTGGAATATCCATAGTAATTTTATTATCA 

SEQ1603 TGACAAACC AAACTGTTGATTTAGACAAAATGGTTCGTCCAAGTGATATCTATG 

SEQ1604 CGTCGATTTCTATGA- -CTATGACGCCAAAT -ATATTGATA 

SEQ1605 

SEQ16 0 6 TGGCGAAGTCGTTAAAGACGTCGATTTCTATGA- - CTATGACGCCAAAT - ATATTGATA 

SEQ1607 TGGCGAAGTCGTTAAAGACGTCGATTTCTATGA- - CTATGACGCCAAAT - ATATTGATA 

SEQ1608 TGACAAACC AAAC T GTT G AT T T AGAC AAAAT GGT T C GT C C AAGT GAT AT C TAT G 

SEQ1609 TGGCGAAGTCGTTAAAGACGTCGATTTCTATGA- - CTATGACGCCAAAT- ATATTGATA 

SEQl 6 1 0 TGACAAACC AAACTGTTGATTTAGACAAAATGGTTCGTCCAAGTGATATCTATG 

SEQ1611 TGGCGAAGTCGTTAAAGACGTCGATTTCTATGA- - CTATGACGCCAAAT -ATATTGATA 

SEQ16 12 TGACAAACC AAACTGTTGATTTAGACAAAATGGTTCGTCCAAGTGATATCTATG 

SEQ1613 TGGCGAAGTTGTTAAAGACGT CGATT TC TATGA - - CTATGACGCCAAAT - ATATTGATA 

SEQ16 14 TGACAAACC AAACTGTTGATTTAGACAAAATGGTTCGTCCAAGTGATATCTATG 

SEQ16 15 TGGCGAAGTCGTTAAAGACGTCGATTTCTATGA - - CTATGACGCCAAAT- ATATTGATA 

SEQl 6 16 TGACAAACC AAAC TGTTGATTTAGAC AAAATGGTTCGTC CAAGTGATAT CTATG 

SEQ16 17 TTCCATAGAT GCTTCATCAACTTTAGCTGGAATATCCATAGCAATTTTATTATCA 

SEQ1601 T AAAAT T AC TAT - -GGATATTCCAGCTAAAGTTGATGAAGCAACTATGGAAGCAATGCG 

SEQ1602 TATATTTGGCGTCATAGTCATAGAAATCGACGTCTTTAACGACTTCGCCAGG--AAAAG 

SEQ16 03 TGATAATGCAAT - - TGTTTTCCCCGTTTTAC ATGGACC AATGGGGGAAG - - ATGGT 

SEQl 6 0 4 T AAAAT TACTAT - - GGATATT CC AGC TAAAGTTGATGAAGC AAC TATGGAAGC AATG CG 

SEQ1605 

SEQl 606 TAAAATTACTAT - - GGATAT TC C AGC TAAAGTTGATGAAGCAACT ATGGAAGCAATGCG 

SEQl 6 0 7 TAAAATTACTAT - - GGATAT TC C AG CTAAAGTTGATGAAGCAACT ATGGAAGCAATGCG 

SEQ1608 TGATAATGCAAT- -TGTTTTCCCCGTTTTAC ATGGACC AATGGGGGAAG- -ATGGT 

SEQ160 9 TAAAATTACTAT- - GGATATTCCAGCTAAAGTTGATGAAGCAAC TATGGAAGC AATGCG 

SEQ1610 TGATAAT 

SEQ1611 TAAAATTACTAT - - GGAT ATTCC AGC TAAAGTTGATGAAGC AACTATGGAAGCAATGCG 

SEQ1612 TGATAATGCAAT- -TGTTTTCCCCGTTTTAC ATGGACC AATGGGGGAAG- -ATGGT 

SEQ1613 TAAAATTACTAT - - GGATATT CC AGC TAAAGTTGATGAAGC AACTATGGAAGCAATGCG 

SEQ1614 TGATAATGCAAT- -TGTTTTCCCCGTTTTAC ATGGACC AATGGGGGAAG- - ATGGT 

SEQ1615 TAAAATTACTAT- -GGATATTCCAGCTAAAGTTGATGAAGC AACTATGGAAGCAATGCG 

SEQ1616 TGATAATGCAAT- -TGTTTTCCCCGTTTTAC ATGGACC AATGGGGGAAG- -ATGGT 

SEQ1617 TATATTTGGCGT ABLECMPARATIVESEQENCESRELA-TINGTSAGD- -ALANI 
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Table 16: Comparative Sequences relating to SAG0767 (D-alanine - D-alanine ligase) 



SEQ1601 
SEQ1602 
SEQ16Q3 
SEQ1604 
SEQ1605 
SEQ1606 
SEQ1607 
SEQ1608 
SEQ1609 
SEQ161Q 
SEQ1611 
SEQ1612 
SEQ1613 
SEQ1614 
SEQ1615 
SEQ1616 
SEQ1617 

SEQ1601 
SEQ1602 
SEQ1603 
SEQ1604 
SEQ1605 
SEQ1606 
SEQ1607 
SEQ1608 
SEQ1609 
SEQ1610 
SEQ1611 
SEQ1612 
SEQ1613 
SEQ1614 
SEQ1615 
SEQ1616 
SEQ1617 

SEQ1601 
SEQ1602 
SEQ1603 
SEQ1604 
SEQ1605 
SEQ1606 
SEQ1607 
SEQ1608 
SEQ1609 
SEQ1610 
SEQ1611 
SEQ1612 
SEQ1613 
SEQ1614 
SEQ1615 
SEQ1616 
SEQ1617 



CAATATGCAAGTAAAGCTTTTAAAGCAATCGGGGCTTGTGGTTTATCACGCTGTGATTT 
T-- GTCTTAACATCATTATTGCCTAAAATACCTACTTCAATTTCACGAGCTGTCACGCC 
C- - TATCCAAG - -GATTTTTAGAAGTTTTAAGGATGCCTTATGTTGGGACTAATATTCT 
CAATATGCAAGTAAAGCTTTTAAAGCAATCGGGGCTTGTGGTTTATCACGCTGTGATTT 

CAATATGCAAGTAAAGCTTTTAAAGCAATCGGGGCTTGTGGTTTATCACGCTGTGATTT 
CAATATGCAAGTAAAGCTTTTAAAGCAATCGGGGCTTGTGGTTTATCACGCTGTGATTT 
C - - TATCCAAG - - GATTTTTAGAAGTTTTAAGGATGCCTTATGTTGGGACTAATATTCT 
CAATATGCAAGTAAAGCTTTTAAAGCAATCGGGGCTTGTGGTTTATCACGCTGTGATTT 

CAATATGCAAGTAAAGCTTTTAAAGCAATCGGGGCTTGTGGTTTATCACGCTGTGATTT 
C- - TATCCAAG- -GATTTTTAGAAGTTTTAAGGATGCCTTATGTTGGGACTAATATTCT 
CAATATGCAAGTAAAGCTTTTAAAGCAATCGGGGCTTGTGGTTTATCACGCTGTGATTT 
C- -TATCCAAG- -GATTTTTAGAAGTTTTAAGGATGCCTTATGTTGGGACTAATATTCT 
CAATATGCAAGTAAAGCTTTTAAAGCAATCGGGGCTTGTGGTTTATCACGCTGTGATTT 
C- - TATCCAAG - -GATTTTTAGAAGTTTTAAGGATGCCTTATGTTGGGACTAAT ATT CT 
E — DALANINELIGASE 

TTTTTGACGAAAGA- - TGGACAAATCTTCTTAAACGAACTGAA- TACAATGCCCGGTTT 
TGTTCAATCAAAATACGGCTATCATACTTGAGAGCTAAGTCAATKSCAGAGCGAAGTGA 

TCTTCAAGCGTGGCTAT 

TTTTTGACGAAAGA- - TGGACAAATCTTCTTAAACGAACTGAA- TACAATGCCC 

TTTTTGACGAAAGA- - TGGACAAATCTTCTTAAACGAACTGAA- TACAATGCCCGGTTT 
TTTTTGACGAAAGA- -TGGACAAATCTTCTTAAACGAACTGAA- TACAATGCCCGGTTT 

TCTTCAA 

TTTTTGACGAAAGAA - TGGACAAATCTTCTTAAACGAACTGAAATAC 

TTTTTGACGAAAGA- - TGGACAAATCTTCTTAAACGAACTGAA- TACAATGCCCGGTTT 

TCTTCAAGCGTGGCTATGGATAAAATTACAACAAAACAAGTCCTTGCAACAGTAG 

TTTTTGACGAAAGA - -TGGACAAATCTTCTTAAACGAACTGAA- TACAATGCCCGGTTT 
TCTTCAAGCGTGGCTATGGATAAAATTACAACAAAACAAGTCCTTGCAACAGTAGGTGT 
TTTTTGACGAAAGA- - TGGACAAATCTTCTTAAACGAACTGAA- TACAATGCCCGGTTT 
TCTTCAAGCGTGGCTATGGATAAAATTACAACAAAACAAGTCCTTGCAACAGTAGG 

ACTCAGTGGTCAATGTATCCTCTGCTTTGGGAAAA - TATGGGGC TAACTTATAGTGATT 
GATTCATCTGTCGCTTTTGAAATACCTACTGATGACCCCATATTAGCCGGTTTTACAAA 

ACTCAGTGGTCAATGTATCCTCTGCTTTGGGAAAA- T 

ACTCAGTGGTCAATGTATCCCCTGCTTTGGGAAAAGTATGGGGCTAACCTT 

ACTCAGTGGTCAATGTATCCCCTGCTTTGGGAAAA-TATGGGGCTAACTTATAG 

ACTCAGTGGTCAATGTATCCTCTGCTTTGGGAAAA-TATGGGGCTAACTT 

CCTCAGG 

ACTCAGTGGTCAATGTATCCCCTGCTTTGGGAAAA- TATGGGGCTAACTTATAGTGA- - 
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Table 16: Comparative Sequences relating to SAG0767 (D-alanine - D-alanine ligase) 



SEQ1601 
SEQ1602 
SEQ1603 
SEQ16 04 
SEQ1605 
SEQ1606 
SEQ1607 
SEQ1608 
SEQ1609 
SEQ1610 
SEQ1611 
SEQ1612 
SEQ1613 
SEQ1614 
SEQ1615 
SEQ1616 
SEQ1617 

SEQ1601 
SEQ1602 
SEQ1603 
SEQ1604 
SEQ1605 
SEQ1606 
SEQ1607 
SEQ1608 
SEQ16 0 9 
SEQ1610 
SEQ1611 
SEQ1612 
SEQ1613 
SEQ1614 
SEQ1615 
SEQ1616 
SEQ1617 

SEQ1601 
SEQ1602 
SEQ16 03 
SEQ1604 
SEQ1605 
SEQ1606 
SEQ1607 
SEQ1608 
SEQ1609 
SEQ16X0 
SEQ1611 
SEQ1612 
SEQ1613 
SEQ1614 
SEQ1615 
SEQ1616 
SEQ1617 



GATTG 

ATTGGGAAACTTAAAGTTTCTAAAGAGAGTTTAATCGCATGTTCCAAATCATCACCCTC 

AAATAAGTTTGATATGCAACCTGAGGTACACCTACTGTTGCAAGGACTTGTTTTGTTGT 

_ * _ „ _ — _. _ „ 

ATTTTATCCATAGCCACGCTTGAAGATAGAATATTAGTCCCAACATAAGGCATCCTTAA 
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Table 16: Comparative Sequences relating to SAG0767 (D-alanine - D-alanine ligase) 



SEQ1601 
SEQ1602 
SEQ1603 
SEQ1604 
SEQ1605 
SEQ1606 
SEQ1607 
SEQ1608 
SEQ160 9 
SEQ1610 
SEQ1611 
SEQ1612 
SEQ1613 
SEQ1614 
SEQ1615 
SEQ1616 
SEQ1617 



ACTTCTAAAAAT CC TTGGATAGAAC CAT CTTC C C C CATTGGTC CATGTAAAACGGGGAA 



SEQ1601 
SEQ1602 
SEQ1603 
SEQ1604 
SEQ1605 
SEQ1606 
SEQ1607 
SEQ1608 
SEQ1609 
SEQ1610 
SEQ161X 
SEQ1612 
SEQ1613 
SEQ1614 
SEQ1615 
SEQ1616 
SEQ1617 



ACAATTGCATTATCATCATAGATATCACTTGGACGAACCATTTTGTCTAAATCAACAGT 



SEQ1601 
SEQ1602 
SEQ1603 
SEQ1604 
SEQ1605 
SEQ1606 
SEQ1607 
SEQ1608 
SEQ1609 
SEQ1610 
SEQ1611 
SEQ1612 
SEQ1613 
SEQ1614 
SEQ1615 
SEQ1616 
SEQ1617 



TGGTTTGTCATTAACTTTTCATCTGAAGATGGCATTTCATCAAATTCTTGTGTTTTAAT 
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Table 16: Comparative Sequences relating to SAG0767 (D~alanine - D-alanine ligase) 



SEQ1601 

SEQ160 2 AATTGACCTACTTGCGTG 

SEQ1603 

SEQ1604 

SEQ1605 

SEQ1606 

SEQ1607 

SEQ1608 

SEQ1609 

SEQ1610 

SEQ1611 

SEQ1612 

SEQ1613 

SEQ1614 

SEQ1615 

SEQ1616 

SEQ1617 
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Table 17: Comparative Sequences relating to SAG1086 (xanthine phophoribosyltransf erase) 



SEQ ID NO. 1701: SAG1086 FROM THE1169NT1 GBS NONTYPEABLE STRAIN 

TTTAAAGGTTGATTCCTTTTTGACTCATCAGGTAGATTTTGAGTTAATGCA^ 
CCGGCATTACGAAGGTTGTTACGATTGAAGCATCTGGAATTGCGCCAGCAGTGTACG^ 
GCTAAAAACKSCTAAGAACATTACTATGACTGAAGGTATCTTAACTGCTGA^ 
TATTGTGAGTCGCTTTTTATCTAACGATGATACTGTACT 

AAATTATTGGTCAAGCTGGAGCTAAGGTTGCTGGTATCGGAATCGTTATTGAAAAATCTTTC 
ACAGGTGTTCCAGT 

SEQ ID NO. 1702s SAG0767 FROM THE 18RS21 GBS TYPE II STRAIN 

TTTAGGTGAGAACATTTTAAAGGTTGATTCTTTTTTGACTCATCAGGTAGATTTTG^ 

ATA^TATAAAGAAGCCGGCATTAC 

GkACCAATGATATTTGCTAAAAAAGCTAAGAACATTACTATGACTGAAGGTATCTO 
TACGAGTCAAGTTTCTATTGTGAGTCGCTTTTTATCTAACGATGATACTGTACTCA 
CTAAAGGATTACTTGAAATTATTGGTCAAGCTGGAGCTAAGGTTGCTGGTAT^ 
GATT TGTTAGAAAAAACA 

SEQ ID NO. 1703s SAG0767 FROM THE H36bl GBS TYPE lb STRAIN 

AAGAACGTATTCTTAAAGATGGTGATGTTTTAGGTGAGAAC^TTTTAAAAGTTGA 
ATGCAGGAAATAGGTAAAGTTTTTGCTGATAAAT^ 
AGCAGTGTACGCAGCTCAAGCATTGGGCGTACCAATGATATT^ 
CTGAAGTGTATTCTTTTACAAAGCAAGTTACGAGTCAAGTTTCTATTGTG 

GATGACTTTTTAGCAAACGGTCAAGCGGCTAAAGGATTACTTGA^TTATTGGTCAAGCTGGAGCTAAGGT 
TATTGAAAAATCTTTCCAAGATGGGCGTGATT 

SEQ ID NO- 1704: SAG0767 FROM THE M732 GBS TYPE III STRAIN 

ATTCTTTTTTGACTATCAGGTAAATTTTGAGTTAATGCAGGAAATAG 

AGGTTGTTACAATTGAAGCATCTGGAATTGCGCCAGCAGTGTACGCAGCTCAAGCATTGGGCGTA 

AAGAACATTACTATGACTGAAGGTATCTTAACTGCTGAAGTGTATTCTTTTACAAAGC^ 

CTTTTTATCTAACGATGATACTGTACT(^TCATTGATGACTTTTTAGCAAACGGTC^ 

AAGCTGAAGCTAAGGTTGCTGGTATCGGAATCGTTATTGAAAAATCTTTCCAAGATGGGCGTGATTTGTTAGAAAAAACAGGTGTTCCG 
GTTACTTCTCTTGCTCGT 

SEQ ID NO. 1705: SAG0767 FROM THE M781 GBS TYPE III STRAIN (REVERSE COMPLEMENT) 

GAACGTATTCTTAAAGATGGTGATGTTTTAGGTGAGAAC^TTTTAAA&GT 

GCAGGAAATAGGTAAAGTTTTTGCTGATAAATATAAAGAAGCC^ 

CAGTGTACGCAGCTCAAGCATTGGGCGTACCAATGATATTTGCTAA 

GAAGTGTATTCTTTTACAAAGCAAGTTACGAGTCAAGTTTCTATTGTGAGTCGCTTTTTATC^ 
TGACTTTTTAACAAACGGTCAAGC 

SEQ ID NO- 1706: SAG0767 FROM THE 090 GBS TYPE la STRAIN (REVERSE COMPLEMENT) 

ACATTTTAAAGGTTGATTCTTTTTTGACTCATCAGGTAGATTTTGAGTTAATGC^ 
GAkGCCGGCATTACGAAGGTTGTTACGATTGAAGCATCTGGAATTGC^ 
ATTTGCTAAAAAAGCTAAGAACATTACTATGACTGAAGGTATCTT^ 
TTTCTATTGTGAGTCGCTTTTTATCTAACGATGATACTGTACTCATCATTGATG 

CTTGAAATTATTGGTCAAGCTGGAGCTAAGGTTGCTGGTATCGGAATCGTTATTGAAAAATCTTTCCAAGATGG 
AAA 

SEQ ID NO. 1707: SAG0767 FROM THE A909 GBS TYPE la STRAIN (REVERSE COMPLEMENT) 

ACGTATTCTTAAAGATGGTGATGTTTTAGGTGAGAACATTTTAAAAGTTGATTC^ 
AGGAAATAGGTAAAGTTTTTGCTGATAAATATAAAGAAGCCGGCAT 
- GTGTACGCAGCTCAAGCATTGGGCGTACCAATGATATTTGCTAAAAAAGCTAAGAACATTACTATGACTGAAGGTAT 
AGTGTATTCTTTTACAAAGCAAGTTACGAGTCAAGTTTCTATTGTGAGTCGCTTTTT^ 
ACTTTTTAGCAAACGGKCAAGCGGSTAAAGGATTACTTGAAATTATTGGTCAAGCTGGAGCTA 

SEQ ID NO. 1708: SAG0767 FROM THE COH1 GBS TYPE la STRAIN 

TTTAAAAGTTGATTCTTTTTTGACTCATCA 

CCGGCATTACGAAGGTTGTTACAATTGAAGCATCTGGAATTGCGCCAGC^^ 
GCTAAAAAAGCTAAGAACATTACTATGACTGAAGGTATCTTAACTG^ 

TATTGTGAGTCGCTTTTTATCTAACGATGATACTGTACTCATCATTGATGACTTTTTAGCAAACGGTCAAGCGGCT 

AAATTATTGGTCAAGCTGAAGCTAAGK5TTGCTGGTATCGGAATCGTTATTGA 

ACAGGTGTTCCGGTTAC 

SEQ ID NO. 1709: SAG0767 FROM THE CJB110 GBS NONTYPEABLE STRAIN (REVERSE 
COMPLEMENT) 

GCTGATAAATATAAAGAAGCCGGCATTACGAAGGTTGTTACAATT 

GGGCGTACCAATGATATTTGCTAAAAAAGCTAAGAACATTACTATGACTGAAGGTATCTTAACTGCTGAAGTGTATTCTTTTACA 
AAGTTACGAGTCAAGTTTCTATTGTGAGTCGCTTTTTATCTAACGATGATACTGTACTCATCATT 

GCGGCTAAAGGATTACTTGAAATTTATTGGTCAAGCTGGAGCTAAGGTTGCTGGTATCGGAATCGTTATTGAAAAATCTTTCCAAG^ 
GGCGTGATTTGTTAGAAAAAACAGGTGTTCCAGT 
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Table 17: Comparative Sequences relating to SAG1086 (xanthine phophoribosyltransf erase) 



SEQ ID NO. 1710: SAG0767 PROM THE 2603 V/R.GBS TYPE V STRAIN 

AACGTATTCTTAAAGATGX3TGATGTTTTAGK3TGAGAACATTTTAAAAG 

CAGGAAATAGGTAAAGTTTTTGCTGATAAATATAAAGAAGCCGGCATTACGAAGGTTGTTACAATTGAAGCATCTGGAATTG 
AGTGTACGCAGCTCAAGCATTGGGCGTACCAATGATATTTGCTAAAAA^ 

AAGTGTATTCTTTTACAAAGC^GTTACGAGTC^GTTTCTATTGTGAGTCGCTTTTTATCTAACGATGATACTGTACT(^TCATTGAT 

GACTTTTTAGCAAACGGTCAAGCGGCTAAAGGATTACTTGAAATTATTGGTCAAGCTGGAGCTAAGGTTGCTGGTATCGGAA 

TGAAAAATCTTTCCAAGATGGGCGTGATTTGTTAGAAAAAACAGGTGTTCCAG 

SEQ ID NO. 1711: SAG0767 FROM THE JM9130013 GBS TYPE VIII STRAIN (REVERSE 
COMPLEMENT) 

ACGAAGGTTGTTACAATTGAAGCATCTGGAATTGCGCCAGCAGT 

AGCTAAGAACATTACTATGACTGAAGGTATCTTAACTGCTGAAGTGTATTCTTTTACAA^ 

GTCGCTTTTTATCTAACGATGATACTGTACTCATCATTGATGACTTTTTAGCAAACGGTCA 

GGTCAAGCTGGAGCTAAGGTTGCTGGTATCGGA 



SEQ17 01 TTTAAAGGTTGATTCCT 

SEQ17 02 TTTAGGTGAGAACATTTTAAAGGTTGATTCTT 

SEQ17 03 AGAACGTATTCTTAAAGATGGTGATGTTTTAGGTGAGAACATTTTAAAAGTTGATTCTT 

SEQ1704 ATTCT 

SEQ1705 -GAACGTATTCTTAAAGATGGTGATGTTTTAGGTGAGAACATTTTAAAAGTTGATTCTT 

SEQ17 06 - ■ ACATTTTAAAGGTTGATTCTT 

SEQ1707 ACGTATTCTTAAAGATGGTGATGTTTTAGGTGAGAACATTTTAAAAGTTGATTCTT 

SEQ17 08 TTTAAAAGTTGATTCTT 

SEQ1709 

SEQ17 10 - - AACGTATTCTTAAAGATGGTGATGTTTTAGGTGAGAACATTTTAAAAGTTGATTCTT 

SEQ1711 

SEQ17 0 1 TTTGAC TC ATCAGGTAGATT TTGAGTTAATGCAGGAAATAGGTAAAGTTTTTGCTGATA 

SEQ17 0 2 TTTGACT CATCAGGT AGATT TTGAGTTAATGCAGGAAATAGGTAAAGTTTTTGCTGATA 

SEQ17 0 3 TTTGACTCATCAGGTAGATTTTGAGTTAATGCAGGAAATAGGTAAAGTTTTTGCTGATA 

SEQ17 0 4 T T T TGAC T AT C AGGT AAAT T T T GAGT T AATG C AGG AAATAGGT AAAGT T T T TG C TGATA 

SEQ1705 TTTGAC T C ATCAGGTAAATTTTGAGT TAATGCAGGAAATAGGTAAAGTTTTTGCTGATA 

SEQ17 0 6 TTTGACTCATCAGGTAGATTTTGAGTTAATGCAGGAAATAGGTAAAGTTTTTGCTGATA 

• SEQ1707 TTTGACTCATCAGGTAGATTTTGAGTTAATGCAGGAAATAGGTAAAGTTTTTGCTGATA 

SEQ17 0 8 TTTGACTCATCAGGTAAATTTTGAGTTAATGCAGGAAATAGGTAAAGTTTTTGCTGATA 

SEQ17 09 GCTGATA 

SEQ17 1 0 TTTGACT CAT CAGGTAGATTTTGAGTTAATGCAGGAAATAGGTAAAGTTTTTGCTGATA 

SEQ1711 

SEQ17 0 1 ATATAAAGAAGCCGGCATTACGAAGGTTGTTACGATTGAAGCATCTGGAATTGCGCCAG 

SEQ17 0 2 ATATAAAGAAGCCGGCATTACGAAGGTTGTTACGATTGAAGCATCTGGAATTGCACCAG 

SEQ17 0 3 AT ATAAAGAAGCCGGC AT TACGAAGGT TGTTAC AATTGAAGCAT CTGGAATTGCGCCAG 

SEQ1704 ATATAAAGAAGC CGGC ATTACGAAGGT TGTTACAATTGAAGC AT CTGGAATTGCGCCAG 

SEQ17 0 5 AT ATAAAGAAGC CGGC AT TACGAAGGTTGTTACAATTGAAGC AT CTGGAATTGCGCCAG 

SEQ17 0 6 ATATAAAGAAGCCGGCATTACGAAGGTTGTTACGATTGAAGCATCTGGAATTGCACCAG 

SEQ1707 ATATAAAGAAGCCGGCATTACGAAGGTTGTTACAATTGAAGCATCTGGAATTGCGCCAG 

SEQ17 0 8 AT ATAAAGAAGCCGGC ATTACGAAGGTTGTTACAATTGAAGC AT CTGGAAT TGCGC C AG 

SEQ17 0 9 AT ATAAAGAAGCCGGC AT TACGAAGGTTGTTACAATTGAAGC AT CTGGAATTGCGCCAG 

SEQ1710 ATATAAAGAAGCCGGCATTACGAAGGTTGTTACAATTGAAGCATCTGGAATTGCGCCAG 

SEQ1711 ACGAAGGTTGTTACAATTGAAGC AT CTGGAATTGCGC C AG 

SEQ1701 CAGTGTACGCAGCTCAAGCATTGGGCGTACCAATGATATTTGCTAAAAAGGCTAAGAACA 

SEQ17 0 2 C AGTGT AC G C AG C T C AAG CAT T GGG C G KAC C AAT GAT AT T T G C T AAAAAAG C T AAG AAC A 

SEQ17 0 3 CAGTGTACGCAGCTCAAGCATTGGGCGTACCAATGATATTTGCTAAAAAAGCTAAGAACA 

SEQ1704 CAGTGTACGCAGCTCAAGCATTGGGCGTACCAATGATATTTGCTAAAAAAGCTAAGAACA 

SEQ170 5 CAGTGTACGCAGCTCAAGCATTGGGCGTACCAATGATATTTGCTAAAAAAGCTAAGAACA 

SEQ17 0 6 CAGTGTACGCAGCTCAAGCATTGGGCGTACCAATGATATTTGCTAAAAAAGCTAAGAACA 

SEQ1707 CAGTGTACGCAGCTCAAGCATTGGGCGTACGAATGATATTTGCTAAAAAAGCTAAGAACA 

SEQ 17 0 8 C AGT G T A C G C AG C T C AAG CAT TGG G C GT A C C AAT GAT AT TT GC T AAAAAAG C T AAG AAC A 

SEQ17 0 9 CAGTGTACGCAGCTCAAGCATTGGGCGTACCAATGATATTTGCTAAAAAAGCTAAGAACA 

SEQ17 1 0 CAGTGTACGCAGCTCAAGCATTGGGCGTACCAATGATATTTGCTAAAAAAGCTAAGAACA 

SEQ1711 CAGTGTACGCAGCTCAAGCATTGGGCGTACCAATGATATTTGCTAAAAAAGCTAAGAACA 
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Table 17: Comparative Sequences relating to SAG1086 (xanthine phophoribosyltransferase) 



SEQ1701 
SEQ1702 
SEQ1703 
SEQ1704 
SEQ1705 
SEQ1706 
SEQ1707 
SEQ1708 
SEQ1709 
SEQ1710 
SEQ1711 

SEQ1701 
SEQ1702 
SEQ17 03 
SEQ17 04 
SEQ1705 
SEQ170 6 
SEQ1707 
SEQ1708 
SEQ1709 
SEQX710 
SEQ1711 

SEQ1701 
SEQ1702 
SEQ1703 
SEQ1704 
SEQ1705 
SEQ170 6 
SEQ1707 
SEQ1708 
SEQ1709 
SEQ1710 
SEQ1711 

SEQ1701 
SEQ1702 
SEQ1703 
SEQ1704 
SEQ1705 
SEQ1706 
SEQ1707 
SEQ1708 
SEQ1709 
SEQ1710 
SEQ1711 

SEQ17 01 
SEQ1702 
SEQ1703 
SEQ1704 
SEQ1705 
SEQ1706 
SEQ1707 
SEQ17 08 
SEQ1709 
SEQ1710 
SEQ1711 



TTACTATGACTGAAGGTATCTTAACTGCTGAAGTGTATTCTTTTACAAAGCAAGWTACGA 
TTACTATGACTGAAGGTATCTTAACTGCTGAAGTGTATTCTTTTACAAAGCAAGTTACGA 
TTACTATGACTGAAGGTATCTTAACTGCTGAAGTGTATTCTTTTACAAAGCAAGTTACGA 
TTACTATGACTGAAGGTATCTTAACTGCTGAAGTGTATTCTTTTACAAAGCAAGTTACGA 
TTACTATGACTGAAGGTATRTTAACTGCTGAAGTGTATTCTTTTACAAAGCAAGTTACGA 
TTACTATGACTGAAGGTATCTTAACTGCTGAAGTGTATTCTTTTACAAAGCAAGTTACGA 
TTACTATGACTGAAGGTATCTTAACTGCTGAAGTGTATTCTTTTACAAAGCAAGTTACGA 
TTAC TATGACTGAAGGTATC T TAAC TGCTGAAGTGT ATTC TTTTACAAAGCAAGTTACGA 
TTACTATGAC TGAAGGTATC T TAAC TG CTGAAGTGTATTC TTTTACAAAGCAAGTTACGA 
TTACTATGAC TGAAGGTATCT TAACTGCTGAAGTGTATTC T TTTACAAAGCAAGTTACGA 
TTACTATGACTGAAGGTATC T TAAC TGCTGAAGTGTATTC TTTTACAAAGCAAGTTACGA 

GTCAAGTTTCTATTGTGAGTCGCTTTTTATCTAACGATGATACTGTACTCATCATTGATG 
GTCAAGTTTCTATTGTGAGTCGCTTTTTATCTAACGATGATACTGTACTCATCATTGATG 
GTCAAGTTTCTATTGTGAGTCGCTTTTTATCTAACGATGATACTGTACTCATCATTGATG 
GTCAAGTTTCTATTGTGAGTCGCTTTTTATCTAACGATGATACTGTACTCATCATTGATG 
GTCAAGTTTCTATTGTGAGTCGCTTTTTATCTAACGATGATACTGTACTCATCATTGATG 
GTCAAGTTTCTATTGTGAGTCGCTTTTTATCTAACGATGATACTGTACTCATCATTGATG 
GTCAAGTTTCTATTGTGAGTCGCTTTTTATCTAACGATGATACTGTACTCATCATTGATG 
GTCAAGTTTCTATTGTGAGTCGCTTTTTATCTAACGATGATACTGTACTCATCATTGATG 
GTCAAGTTTCTATTGTGAGTCGCTTTTTATCTAACGATGATACTGTACTCATCATTGATG 
GTCAAGTTTCTATTGTGAGTCGCTTTTTATCTAACGATGATACTGTACTCATCATTGATG 
GTCAAGTTTCTATTGTGAGTCGCTTTTTATCTAACGATGATACTGTACTCATCATTGATG 

ACTTTTTAGCAAACGGTCAAGCGGCTAAAGGATTACTTGAAATT-ATTGGTCAAGCTGGA 
ACTTTTTAGCAAACGGTCAAGCGGCTAAAGGATTACTTGAAATT - ATTGGTCAAGCTGGA 
ACTTTTTAGCAAACGGTCAAGCGGCTAAAGGATTACTTGAAATT-ATTGGTCAAGCTGGA 
ACTTTTTAGCAAACGGTC AAGCGGC TAAAGGATTACTTGAAATT - ATTGGTCAAGCTGAA 

ACTTTTTAACAAACGGTCAAGC 

ACTTTTTAGCAAACMGTCYAGCGGC TAAAGGATTACTTGAAATT - ATTGGT CAAGCTGGA 
ACTTTTTAGCAAACGGKCAAGCGGS TAAAGGATTACTTGAAATT - ATTGGTCAAGCTGGA 
ACTTTTTAGGAAACGGTCAAGCGGCTAAAGGATTACTTGAAATT- ATTGGTCAAGCTGAA 
ACTTTTTAGCAAACGGTCAAGCGGCTAAAGGATTACTTGAAATTTATTGGTCAAGCTGGA 
ACTTTTTAGCAAACGGTCAAGCGGCTAAAGGATTACTTGAAATT - ATTGGTCAAGCTGGA 
ACTTTTTAGC AAACGGTCAAGCGGCTAAAGGATTACTTGAAATT - ATTGGTCAAGCTGGA 

CTAAGGTTGCTGGTATCGGAATCGTTATTGAAAAATCTTTCCAAGATGGGCGTGATTTG 
CTAAGGTTGC TGGTAT CGGAATCGTTATTGAAAAATC T TT C CAAGATGGGCGTGATTTG 
CTAAGGTTGCTGGTATCGGAATC YTTATTGAAAAATCTTTCCAAGATGGGCGTGATT - - 
CTAAGGTTGCTGGTATCGGAATCGTTATTGAAAAATCTTTCCAAGATGGGCGTGATTTG 

CTAAGGTTGCTGGTATCGGAATCGTTATTGAAAAATCTTTCCAAGATGGGCGTGATTTG 

CTA 

CTAAGGTTGCTGGTATCGGAATCGTTATTGAAAAATCTTTC CAAGATGGGCGTGATTTG 
CTAAGGTTGCTGGTATCGGAATCGTTATTGAAAAATCTTTCCAAGATGGGCGTGATTTG 
CTAAGGTTGCTGGTATCGGAATCGTTATTGAAAAATCTTTCCAAGATGGGCGTGATTTG 
CTAAGGTTGC TGGTATCGGA TAB CMARAT VS TN C S R - - ATNGTS AGXANTHN 

TAGAAAAAACAGGTGTTCCAGT 

TAGAAAAAACA 

TAGAAAAAACAGGTGTTCCGGTTACTTCTCTTGCTCGT 
< TAGAAAA 

TAGAAAAAACAGGTGTTCCGGTTAC 

TAGAAAAAACAGGTGTTCCAGT 

TAGAAAAAACAGGTGTT CC AG 

HRBSYTRANSRAS 
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Table 18: Comparative Sequences relating to SAG1600 (glutamate racemase) 



SEQ ID NO. 1801s SAG1600 PROM THE H36b GBS TYPE lb STRAIN (REVERSE COMPLEMENT) 

AATCTTCATTGGAGATCAGGCTAGAGCTCCGTATGGTCCTAGACCT^ 

TATTGACTAAAAATGTTAAGATGATTGTTATAGCTTGTAATACAGCAACTGCAGTTGCCTGGCAAGAAATTAAAGAAAAACTAGAC 

CCTGTTTTAGGCGTTATTTTACCAGGAGCTAGCGCAGCTATCAAATCAACTAATTCAGGGA 

TGTTAAATCAGATGCTTATCGTCAAAAAATTCAAGCTTTGTCTC 

TTGTGGAATCAAATCAGATGTCTTCTAGTTTAGCCAAAAAGGTGGTTTATGAAACGTTGTCCCCATTAGTTGGTAAATTAGATACTTTA 

ATTTTAGGTTGCACGCATTATCCCTTATTACGTCCCATCATTCAAAATGTTATGGGGGCTGAGGTTAAATTAATT^ 

AACCGTTCGTGATATTTCTGTTTTATTGAACTATTTTGAGATAAACCATAATTGGCAAAATAAACACGGTGGTC^ 

CCGCCAGCCCAA 

SEQ ID NO. 1802: SAG1600 FROM THE M732 GBS TYPE III STRAIN (REVERSE COMPLEMENT) 

AAATGTTCCGTCAACTTCCAGAAGAGGAAGTAATCTTCATTGGAGATCAGGCTAGAGCTCCGTATGGTCCTAGACC 

AGAGAGTTTACCTGGCAGATGGTTAACTTCTTATTGACTAAAAATGTTAAGATGATTGTT^^ 

CTGGCAAGAAATTAAAGAAAAACT AGACAT CC CTGT T T TAGGCGT TATT TT ACCAGGAGCTAGCGCAGCTATCAAATCAACTAATTTAG 

GGAAAGTTGGTATTATAGGTACTCCCATGACTGTTAAATCAGATGCTTATCGTCAAAAAA 

GTATCCCTTGCTTGTCCGAAATTTGTTCCAATTGTGGAATCAAATCAG^TGTCTTCTAGTTTAGCCAA 

GTCCCCATTAGTTGGTAAATTAGATACTTTAATTTTAGGTTGC^^ 

CTGAGGTTAAATTAATTGATAGTGGCGCAGAAACCGTTCGTGATATTTCTGTTTTATTGAACTAT 
AATAAACACGGTGGTC^TCACTTTTAC^CMCCGCC^GCCCAAAAGGTTTTAAAGAAA 

SEQ ID NO. 1803: SAG1600 FROM THE 090 GBS TYPE la STRAIN 

AATCTTCATTGGAGACCAGGCTAGAGCTCCGTATGG 

ATTGACTAAAAATGTTAAGATGATTGTTATAGCTTGTAATACAGCA^ 

CTGTTTTAGGCGTTATTTTACCAGGAGCTAGCGCAGCTATCAAATCA^ 

GTTAAATCAGATGCTTATCGTCAAAAAATTCAAGCTTTGTCTCCAAATACTGCTGTGGTATCCCTTGCTTGTCCGAAAT 

TGTGGAATCAAATCAGATGTCTTCTAGTTTAGCCAAAAAGGTGGTTTATGAAACGCTGTCCC 

TTTTAGGTTGCACGCATTATCCCTTATTACGTCCCATCATTCAAAATGTT^^ 

ACCGTTCGTGATATTTCTGTTTTATTGAACTATTTTGAGATaAmCCATaATTGGsmAAATAAACACGGTGGTGATCACTTTTACA^ 
CGsCAGCCCAAAAGGTTTTTAAGGAAATTGCAGAACAATGGCTTAATCAAGAAATAAAT 



SEQ ID NO. 1804: SAG1600 FROM THE A909 GBS TYPE la STRAIN 

GCGGTTGTGTAAAAGTGATGACCACCGTGTTTATTTTGCCAATTA^ 
GGTTTCTGCGCCACTATCAATTAATTTAACCTCAGCC^ 

AAATTAAAGTATCTAATTTACC^CTAATGGGGACAACGTTTCATAAACCACCTTTTTGGCTAAACTAG 

ACAAT TGGAA CAAAT T T CGGACAAGCAAGGGATAC CACAG CAGT ATT TGG AGACAAAGCT TGAATT T T TT GACGATAAGCAT CT GATTT 
AACAGTCATGGGAGTACCTATAATACCAACTTTCCCTAAATTAGTTGATTTGATAGCTGCGCTAGCTCCTGGTAAAATAACGCCTAAAA 

CAGGGATGTCTAGTTTTTCTTTAATTTCTTGCCAGGCAACTGCAGTTGCTGTATTACAAGCT^ 

AATAAGAAGTTAACCATCTGCCAGGTAAACTCTCTAATCTGTTGAGCAGGTCTAGGACCATACGGAGCTCTAGCCTGATCTCCAAT 
GATTACTTCCTCTTCTGGAAGTTGACGGAACATTTCCTTAACAACCGTTAAACCACCT 

SEQ ID NO. 1805: SAG1600 FROM THE COHl GBS TYPE la STRAIN 

TTCCGTCAACTTCCAAAATATGAAGTAATCTTCATTG 

GTTTACCTGGCAGATGGTTAACTTCTTATTGACTAAAAATGTTAAGA^ 

AAGAAATTAAAGAAAAACTAGACATCCCTGTTTTAGGCGTTATTTT^ 

GTTGGTATTATAGGTACTCC(^TGACTGTTAAATCAGATGCTTATCGTCAAAAAATTCAAGCTTTGTCTCCAAATACTGCTGTGGTATC 
CCTTGCTTGTCCGAAAT 

SEQ ID NO. 1806: SAG1600 FROM THE CJB110 GBS NONTYPEABLE STRAIN 

GTAATCTTC^TTGGAGATCAGGCTAGAGCTCCGTATGGTCCTAGACCTGCTCAACAGATTAGAGAGTTTACCTGGCAGATGGTTAATTT 
CTTATTGACTAAAAATGTTAAGATGATTGTTATAGCTTGTAATACAGCAACTGCAGTTGCCTGGCAAGAAATTAAAGAAAAAC 

TAC 

SEQ ID NO. 1807: SAG1600 FROM THE 11S9NT1 GBS TYPE V STRAIN 

CTTTTGGGCTGGCGGTTGTGTAAAATTGATGACCACCGTGTTTATTTTGCCAATTATGGTTTATCTCAAAATAGTTCAATAAAACAGAA 

ATATCACGAACGGTTTCTGCGCCACTATCAATTAATTTAACCTCAGCCCCCATAACATTTTGA^ 

CGTGCAACCTAAAATTAAAGTATCTAATTTACCAACTAATGGGGACAATGTTT 

GATTTGATTCCACAATTGGAACAAATTTCGGACAAGCAAGGGATACC^ 

G CAT CTG AT T TAA CAGT CAT GGGAGTACCT ATAA 

SEQ ID NO. 1808: SAG1600 FROM THE 1169NT1 GBS TYPE V STRAIN 

GTAATCTTCATTGGGGATCAGGCTAGAGCTCCGTATGGTCCTAGACCTGCT 
CTTATTGACTAAAAATGTTAAGATGATTGTTATAGCTTGTAATACAGCAACTGCAGTT 

SEQ ID NO. 1809: SAG1600 FROM THE 18RS21 GBS TYPE II STRAIN 

GAAATGTTCCGTCAACTTCCAGAAGAGGAAGTAATCTTCATTGGAGATC 

TAGAGAGTTTACCTGGCAGATGGTTAACTTCTTATTGACTAAAAATGTTAAGATGATTGTTATAGCTTGTAATACAGCAACTGCAGTTG 
CCTGGCAAGAAATTAAAGAAAAACTAGACATCCCTGTTTTA 

GGGAAAGTTGGTATTATAGGTACTCCC2ATGACTGTTAAATCAGATGCTTATCGTCAAAAAATTCAAGC 
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Table 18: Comparative Sequences relating to SAG1600 (glutamate racemase) 



SEQ ID NO. 1810: SAG1600 FROM THE 18RS21 TYPE II STRAIN 

ATTTCTTTAAAACCTTTTGGGCTGGCGGTTGTGTAATATTGATGACCACC^ 

CAATAAAACAGAAATATCACGAACGGTTTCTGCGCCACTATC^AATTA^ 

ATATGGGATAATGCGTGCAACCTAAAATTAAAGTA 

SEQ ID NO. 1811: SAG1600 FROM THE 2603 V/R GBS TYPE V STRAIN 

ATTTCTTTAAAACCTTTTGGGCTGGCGGTTGTGTAATAAGTGATGACCACCGTGTTTATTTTGCCAATTATGGTTTATCTC^AAATAGT 
TCAATAAAAGAGAAATATCACGAACGGTTTCTGCGC 

AATAGGGGATAATGCGTGCAACCTAAAATTAAAGTATCTAATTTACCAACTAA 
ACTAGAAGACATCTGATTTGATTCCACAATTGGAACAA 

SEQ ID NO. 1812: SAG1600 FROM THE M7 81 GBS TYPE III STRAIN 

GGCGGTTGTGTAAAAGTGATGACCACCGTGTTTATTTTGCCAATT^ 
CGGTTTCTGCGCCACTATCAATTAATTTAACCT 

AAAATTAAAGTATCTAATTTACCAACTAATGGGGACAACGTTTCATAAACCACCTTTTTGGCTAAACTAGAAGA 
SEQ ID NO. 1813: SAG1600 FROM THE M 781 GBS TYPE III STRAIN 

AATCTTCATTGGAGATCAGGCTAGAGCTCCGTATGGTCCTAGACCTGCTCAACAGATTAGAGAGTTTACCTGGCAGA 
TATTGACTAAAAATGTTAAGATGATTGTTATAGCTTGTAATACAGCAACTGC 

SEQ ID NO. 1814: SAG1600 FROM THE JM9130013 GS TYPE VIII STRAIN 

TGGGCTGGCGGTTGTGTAAAAGTGATGACCACCGTGTTTATTTTGCC^ 

CACGAACGGTTTCTGCGCCACTATCAATTAATTTAACCTC^GCCCCCATAACATTTTG^ 

CAACCTAAAATTAAAGTATCTAATTTACGAACTAATGGGGACA^ 

TGATTCCACAATTGGAACAAATTTCGGACAAGCAAGGGATACCA 

CTGATTTAACAGTCATGGGAGTACCTATAATACCAACTTTCCCTGAA 



SEQ1801 AATCTTCATTGGAGATCAGGCTAGAGCT 

SEQ1 802 AAATGTT C CGTC AACTTC CAGAAGAGGAAGTAATCTTCATTGGAGATCAGGCTAGAGC T 

SEQ1803 AAT C T T C AT TGGAG AC C AGG C T AGAG C T 

SEQ1804 GCGGTTGTGTAAAAG-T 

SEQ 1805 TTC CGTCAAC TTC CAAAATATGAAGTAATCTTCAT TGGAGATC AGGCTAGAGCT 

SEQ1 806 GTAATC TTCAT TGGAGATC AGGCTAGAGCT 

SEQ1 807 CTTTTGGGCTGGCGGTTGTGTAAAAT- T 

SEQ1 808 GTAATCTTCATTGGGGATCAGGCTAGAGCT 

SEQ180 9 AAATGTT CCGT C AACTT C C AGAAGAGGAAGTAATCTTCATTGGAGATC AGGC TAGAGC T 

SEQ1810 - AT TT C T T T AAAA C C T T T TGGG C TGG CGGTTGTGT AAT AT - T 

SEQ1811 ATTTCTTTAAAACCTTTTGGGCTGGCGGTTGTGTAATAAGT 

SEQ1812 GG CGGTTGTGT AAAAG-T 

SEQ18 13 AATCTTCATTGGAGATCAGGCTAGAGCT 

SEQ1814 TGGG C TGG CGGTTGTGT AAAAG-T 

SEQ18 0 1 CGTATGGTC - CTAGAC CTGCT C AAC AGATTAGAGAGTTTAC CTGGC AGATGGTTAATTT 

SEQ18 0 2 CGTATGGTC - CTAGAC C TGCT C AAC AGATTAGAGAGTTTACC TGGC AGATGGTTAACT T 

SEQ1 803 CGTATGGTC - CTAGAC CTGCTCAACAGATTAGAGAGTT- AC CTGGC AGATGGTTAATTT 

SEQ1804 - - GATGACCACCGTGTTTATTTTGCCAATTATGG - - TTTATCTCA - AAATAGTTCA 

SEQ18 0 5 CGTATGGTC - CTAGACCTGCTCAACAGATTAGAGAGTTTACCTGGCAGATGGTTAACTT 

SEQ18 0 6 CGTATGGTC - C TAGAC CTGCTC AAC AGATTAGAGAGTTTAC CTGGCAGATGGTTAATTT 

SEQ1807 - - GATGACCACCGTGTTTATTTTGCCAATTATGG - -TTTATCTCA -AAATAGTTCA 

SEQ 1 8 0 8 CGTATGGTC - CTAGAC C TGCTC AAC AGATTAGAGAGTT TAC CTGGC AGATGGTTAATTT 

SEQ1809 CGTATGGTC - CTAGAC C TGCT C AAC AGATTAGAGAGTTTAC CTGGC AGATGGTTAAC TT 

SEQ1810 - - GATGACCACCGTGTTTATTTTGCCAATTATGG- - TTTATCTCA- AAATAGTTCA 

SEQ1811 - -GATGACCACCGTGTTTATTTTGCCAATTATGG- - TTTATCTCA- AAATAGTTCA 

SEQ1812 - -GATGACCACCGTGTTTATTTTGCCAATTATGG- -TTTATCTCA- AAATAGTTCA 

SEQ1813 CGTATGGTC - CTAGAC CTGCTC AAC AGATTAGAGAGTTTAC CTGGC AGATGGTTAACTT 

SEQ1 8 14 - - GATGACCACCGTGTTTATTTTGCCAATTATGG- - TTTATCTCA- AAATAGTTCA 
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Table 18: Comparative Sequences relating to SAG1600 (glutamate racemase) 



SEQ1801 
SEQ1802 
SEQ1803 
SEQ1804 
SEQ1805 
SEQ1806 
SEQ1807 
SEQ1808 
SEQ1809 
SEQ1810 
SEQ1811 
SEQ1812 
SEQ1813 
SEQ1814 



TTATTGACTAAAAATGTTAAGATGATTGTTATAGCTTGTAATACAGCAACTGCAGTTGC 
TTATTGACTAAAAATGTTAAGATGATTGTTATAGCTTGTAATACAGCAACTGCAGTTGC 
TTATTGACTAAAAATGTTAAGATGATTGTTATAGCTTGTAATACAGCAACTGCAGTTGC 

- - ATAAAACAGAAATATCACGAACGGT- TTCTGCGCCACTATCAATTAATTTAACCTCA 
TTATTGACTAAAAATGTTAAGATGATTGTTATAGCTTGTAATACAGCAACTGCAGTTGC 
T TATTGACTAAAAATGTTAAGATGATTGTT ATAGC TTGTAATACAGCAAC TGC AGTTGC 

- - ATAAAACAGAAATATCACGAACGGT -TTCTGCGCCACTATCAATTAATTTAACCTCA 
TTATTGACTAAAAATGTTAAGATGATTGTTATAGCTTGTAATACAGCAACTGCAGTT - - 
TTATTGACTAAAAATGTTAAGATGATTGTTATAGCTTGTAATACAGCAACTGCAGTTGC 

- - ATAAAACAGAAATATCACGAACGGT - TTCTGCGCCACTATCAATTAATTTAACCTCA 

- - ATAAAACAGAAATATCACGAACGGT - TTCTGCGCCACTATCAATTAATTTAACCTCA 

- - ATAAAACAGAAATATCACGAACGGT - TTCTGCGCCACTATCAATTAATTTAACCTCA 
TTATTGAC TAAAAATGTTAAGATGATTGTTATAGC TTGTAATACAGCAACTGC 

- - ATAAAACAGAAATATCACGAACGGT - TTCTGCGCCACTATCAATTAATTTAACCTCA 



SEQ1801 
SEQ1802 
SEQ1803 
SEQ1804 
SEQ1805 
SEQ1806 
SEQ1807 
SEQ1808 
SEQ1809 
SEQ1810 
SEQ1811 
SEQ1812 
SEQ1813 
SEQ1814 



TGGCAAGAAATTAAAGAAAAACTAGACGTGCCTGTTTTAGGCGTTATTTTACCAGGAGC 
TGGCAAGAAATTAAAGAAAAACTAGACATCCCTGTTTTAGGCGTTATTTTACCAGGAGC 
TGGCAAGAAATTAAAGAAAAACTAGACATACCTGTTTTAGGCGTTATTTTACCAGGAGC 
CCCCCATAACATTTTGAATGATGGGACGTAATAGGGGATAATGC - GTGCAACCTAAAAT 
TGGCAAGAAATTAAAGAAAAACTAGACATCCCTGTTTTAGGCGTTATTTTACCAGGAGC 

TGGC AAGAAATTAAAGAAAAAC TAGACATAC 

C CCC C ATAACATTTTGAATAATGGGACGTAATAGGGGATAATGC - GTGCAACCTAAAAT 

TGGCAAGAAATTAAAGAAAAACTAGACATCCCTGTTTTAGGCGTTATTTTACCAGGAGC 
C C CC CATAAC ATTTTGAATGATGGGACGTAATATGGGATAATGC - GTGCAACCTAAAAT 
CC C C C ATAAC ATTTTGAATGATGGGACGTAATAGGGGATAATGC - GTGCAACCTAAAAT 
CCCCCATAACATTTTGAATGATGGGACGTAATAGGGGATAATGC - GTGCAACCTAAAAT 

C CC C C ATAACATTTTGAATGATGGGACGT AATAAGGGATAATGC - GTGCAACCTAAAAT 



SEQ1801 
SEQ1802 
SEQ1803 
SEQ1804 
SEQ1805 
SEQ1806 
SEQ1807 
SEQ1808 
SEQ1809 
SEQ1810 
SEQ1811 
SEQ1812 
SEQ1813 
SEQ1814 

SEQ1801 
SEQ1802 
SEQ1803 
SEQ1804 
SEQ1805 
SEQ1806 
SEQ1807 
SEQ18 0 8 
SEQ1809 
SEQX810 
SEQ1811 
SEQ1812 
SEQ1813 
SEQ1814 



AG C G C AG C TAT C AAAT C AAC TAATT C AGGG AAAGT TGGT AT T AT AGGT AC T C C C ATG AC 
AGCGC AGC TATCAAATCAAC TAAT TTAGGGAAAGTTGGTATTATAGGTAC TCCCATGAC 
AGCGC AGC TATCAAATCAAC TAATT CAGGGAAAGTTGGTATTATAGGT AC TC CCATGAC 
AAAGT AT C TAATTTACCAAC TAATGGGGACAACGTTTC ATAAACCACCT TTTTGGC TAA 
AGCGCAGCTATCAAATCAACTAATTTAGGGAAAGTTGGTATTATAGGTACTCCCATGAC 

AAAGT ATCTAATTTACCAACTAATGGGGACAATGTTTC AT AAACCACCTTTTTGGCTAA 

AG C GC AG C TATC AAAT C AAC TAATT TAG GG AAAGT TGGT AT TAT AGGT AC T C C CAT G AC 

AAAGTA 

AAAGTATCTAATTTACCAACTAATGGGGACAACGTTTCATAAACCACCTTTTTGGCTAA 
AAAGTATCTAATTTACCAACTAATGGGGACAACGTTTCATAAACCACCTTTTTGGCTAA 

AAAGTATCTAATTTACCAACTAATGGGGACAACGTTTCATAAACCACCTTTTTGGCTAA 

GTTAAATCAGATGCTTATCGTCAAAAAATTCAAGCTTTGTCTCCAAATACTGCTGTGGT 
GTTAAATCAGATGCTTATCGTCAAAAAATTCAAGCTTTGTCTCCAAATACTGCTGTGGT 
GTTAAATCAGATGCTTATCGTCAAAAAATTCAAGCTTTGTCTCCAAATACTGCTGTGGT 
CTAGAAGACATCTGATTTGATTCCACAATTGGAACAAATTTCGGACAAGCAAGGGATAC 
GTTAAATCAGATGCTTATCGTCAAAAAATTCAAGCTTTGTCTCCAAATACTGCTGTGGT 

C TAGAAGAC ATCTGATTTGATT CC AC AATTGGAAC AAATTT CGGAC AAG C AAGGGATAC 

GTTAAATCAGATGCTTATCGTCAAAAAATTCAAGC 

CTAGAAGACATCTGATTTGATTCCACAATTGGAACAA 

CTAGAAGA 

C TAGAAGACAT C TGATT TGATT C CAC AATTGGAACAAAT TTCGGAC AAGCAAGGGATAC 
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Table 18: Comparative Sequences relating to SAG1600 (glutamate racemase) 



SEQ1801 
SEQ1802 
SEQ1803 
SEQ1804 
SEQ1805 
SEQ1806 
SEQ1807 
SEQ1808 
SEQ1809 
SEQ1810 
SEQ1811 
SEQ1812 
SEQ1813 
SEQ1814 



TCCCTTGCTTGTCCGAAATTTGTTCCAATTGTGGAATCAAATCAGATGTCTTCTAGTTT 
TCCCTTGCTTGTCCGAAATTTGTTCCAATTGTGGAATCAAATCAGATGTCTTCTAGTTT 
TCCCTTGCTTGTCCGAAATTTGTTCCAATTGTGGAATCAAATCAGATGTCTTCTAGTTT 
ACAGCAGTATTTGGAGACAAAGCTTGAATTTTTTGACGATAAGCATCTGATTTAACAGT 
TCCCTTGCTTGTCCGAAAT- 

ACAGCAGTATTTGGAGACAAAGCTTGAATTTTTTGACGATAAGCATCTGATTTAACAGT 
ACAGCAGTATTTGGAGACAAAGCTTGAATTTTTTGACGATAAGCATCTGATTTAACAGT 



SEQ1801 
SEQ1802 
SEQ1803 
SEQ1804 
SEQ1805 
SEQ1806 
SEQ1807 
SEQ1808 
SEQ1809 
SEQ1810 
SEQ1811 
SEQ1812 
SEQ1813 
SEQ1814 



GCCAAAAAGGTGGTTTATGAAACGTTGTCCCCATTAGTTGGTAAATTAGATACTTTAAT 
GC C AAAAAGGTGGTTTATGAAACGTTGTCC C CATTAGTTGGTAAATTAGATAC TTTAAT 
GCCAAAAAGGTGGTTTATGAAACGCTGTCCCCATTAGTTGGTAAATTAGATACTTTAAT 
ATGGGAGTACCTATAATACCAACTTTCCCTAAATTAGTTGATTTGATAGCTGCGCTAGC 



ATGGGAGTACCTATAA- 



ATGGGAGTAC CTATAATACCAACTTTCC CTGAATAB CMARATVS TNC S RATNGTS AGGT 



SEQ1801 
SEQ1802 
SEQ1803 
SEQ1804 
SEQ1805 
SEQ1806 
SEQ1807 
SEQ1808 
SEQ1809 
SEQ1810 
SEQ1811 
SEQ1812 
SEQ1813 
SEQ1814 



TTAGGTTGCACGC ATTATC C CTTATTACGTCC CATCATTCAAAATGTTATGGGGGCTGA 
TTAGGTTGCACGC ATTATC C C C TATTACGTCC CATCATT CAAAATGTTATGGGGGCTGA 
TTAGGTTGCACGCATTATCCCTTATTACGTCCCATCATTCAAAATGTTATGGGGGCTGA 
CCTGGTAAAATAACGCCTAAAACAGGGATGTCTAGTTTTTCTTTAATTTCTTGCCAGGC 



AMATRACMAS 



SEQ1801 
SEQ1802 
SEQ1803 
SEQ1804 
SEQ1805 
SEQ1806 
SEQ1807 
SEQ1808 
SEQ1809 
SEQ1810 
SEQ1811 
SEQ1812 
SEQ1813 
SEQ1814 



GTTAAATTAATTGATAGTGGCGCAGAAACCGTTCGTGATATTTCTGTTTTATTGAACTA 
GTTAAATTAATTGATAGTGGCGCAGAAACCGTTCGTGATATTTCTGTTTTATTGAACTA 
GTTAAATTAATTGATAGTGGCGCAGAAACCGTTCGTGATATTTCTGTTTTATTGAACTA 
ACTGCAGTTGCTGTATTACAAGCTATAACAATCATCTTAACATTTTTAGTCAATAAGAA 
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Table 18: Comparative Sequences relating to SAG1600 (glutamate racemase) 



SEQ1801 
SEQ18 02 
SEQ1803 
SEQ1804 
SEQ1805 
SEQ1806 
SEQ1807 
SEQ1808 
SEQ1809 
SEQ1810 
SEQ1811 
SEQ1812 
SEQ1813 
SEQ1814 

SEQ1801 
SEQ1802 
SEQ1803 
SEQ1804 
SEQ1805 
SEQ1806 
SEQ1807 
SEQ1808 
SEQ1809 
SEQ1810 
SEQ1811 
SEQ1812 
SEQ1813 
SEQ1814 

SEQ1801 
SEQ1802 
SEQ1803 
SEQ1804 
SEQ1805 
SEQ1806 
SEQ1807 
SEQ180 8 
SEQ1809 
SEQ1810 
SEQ1811 
SEQ1812 
SEQ1813 
SEQ1814 



TTTGAGATAAACCATAATTGGCAAAATAAACACGGTGGTCATCACTTTTACACAACCGC 
TTTGAGATAAACCATAATTGGCAAAATAAACACGGTGGTCATCACTTTTACACAACCGC 
TTTGAGATAAMCCATAATTGGSMAAATAAACACGGTGGTCATCACTTTTACACAACCGS 
TTAACCATCTGCCAGGTAAACTCTCTAATCTGTTGAGCAGGTCTAGGACCATACGGAGC 

AGCCCAA 

AGCCCAAAAGGTTTTAAAGAAA 

AGCCCAAAAGGTTTTTAAGGAAATTGCAGAACAATGGCTTAATCAAGAAATAAAT 

CTAGCCTGATCTCCAATGAAGATTACTTCCTCTTCTGGAAGTTGACGGAACATTTCCTT 

ACAACCGTTAAACCACCT 
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Table 19: Comparative Sequences relating to SAG1680 (shikimate 5-dehydrogenase) 



SEQ ID NO. 1901: SAG1680 FROM THE 2603 V/R GBS TYPE V STRAIN 

ATCCCTAGACCATTATAAGCATGTTTCACTCCATTTTGTCTAACAAATCGTAACAATGCTGTTTCTTTAGGCTTGTAAAC 
CAAGTCGACAACTACTAAATTCGGTGTTAAAATTTCTGGATCGTTAATTAAACTATAATTATCTAATGGCCTCATTCCTA 
AACTAGTAGCATCAATATAAAAATGACTAGTTCTAATAGCGTCTTTAAATGCTGTCTTATTTTCTAGATAATCAACGACT 
ACCTTTATT TGAAACTGTTT TTTAATTTTATCTGATAAGTCAATGACC T TATCGTAATTTGAGCTGTTACGATTAAATAA 
TCTAATTTCCGCAACTCCCTCCATAGCTGCTTGAACTGCAACTGCTTTACCTGAACCACCAATACCAGCTATTGTAATTA 
TTTTATTTTTAGCACTGAAACCTTGAGCTGCTAAAGCTTTAAAACAACCAATGCCATCTGTCATATGGCCTACTAAACGT 
CCGGTT C C ACCTTGATTAACGATAGTATTTACAGC AC C C ACTAATTTAGCTTGAGGAGATAAATC ATCTAGCAAAGGGAT 
AACACTCTGTTTAAATGGC ATTGAAACATTAACAC C ACGAATACCCAATGC C CTGACAC C T CGAACAGCTTCTGTTAATT 
TAC C CTC TT CTACTTCAAATGTCAGATAGGC ATAATTC ATGTTTTTTTCTTGAAAAGAGGTATTC CACATTAACGGGGAT 
AGAGAGTGGCGTGCAGG 

SEQ ID NO. 1902: SAG1680 PROM THE H36b GBS TYPE lb STRAIN 

GTTATTAATTGAAATGCTTCTGCTCCTTGATAAATCAGCATCCCTAGACCATTATAAGCATGTTTCACTCCATTTTGTCT 
AACAAATCGTAACAATGCTGTTtCTTTAGGCTTGTAAACCAAGTCGACAACTACTAAATTCGGTGTTAAAATTTCTGGAT 
CGTTAATTAAAC TATAATTATCTAATGGC CTC ATTC C TAAAC TAGTAGCATCAATATAAAAATGACTAGTTC TAATAGCG 
TCTTTAAATGCTGTCTTATTTTCTAGATAATCAACGACTACCTTTATTTGAAACTGTTTTTTAATTTTATCTGATAAGTC 
AATGACCTTATCGTAATTTGAGCTGTTACGATTAAATAATCTAATTTCCGCAACTCCCTCCATAGCTGCTTGAACTGCAA 
CTGCTTTACCTGAACCACCAATACCAGCTATTGTAATTATTTTATTTTTAGCACTGAAACCTTGAGCTGCTAAAGCTTTA 
AAAC AACC AATGC CATCTGTC ATATGGC CTACT AAACGTCCGGTTC CAC CTTGATTAACGATAGTATTTAC AGCACCCAC 
TAATTTAGCTTGAGGAGATAAATCATCTAGCAAAGGGATAACACTCTGTTTAAATGGCATTGAAACATTAACACCACGAA 
TACCCAATGCCCTGACACCTCGAACAGCTTCTGTTAATTTACCCTCTTCTACTTCAAATGTCAGATAGGCATAATTCATG 
TTTTTTTCTTGAAAAGAGGTATT C CACATTAACGGGGAT AGAGAGTGGC GTGCAGGA 

SEQ ID NO. 1903: SAG1680 FROM THE M732 GBS TYPE III STRAIN 

CTGGTCTAATTGCCAATCCTGCACGCCACTCTCTATCCCCGTTAATGTGGAATACCTCTTTTCAAGAAAAAAACATGAAT 
TATGCCTATCTGACATTTGAAGTAGAAGAGGGTAAATTAACAGAAGCTGTTCGAGGTGTCAGGGCATTGAGTATTCGTGG 
TGTTAATGTTTCAATGCCATTTAAACAGAGTGTTATC CC TTTGCTAGATGATTTATCTC CTC AAGCTAAATTAGTGGGTG 
CTGTAAATACTATCGTTAATCAAGGTGGAACCGGACGTTTAGTAGGCCATATGACAGATGGCATTGGTTGTTTTAAAGCT 
TTAGCAGCTCAAGGTTTCAGTGCTAAAAATAAAATAATTACAATAGCTGGTATTGGTGGTTCAGGTAAAGCAGTTGCAGT 
TCAAGCAGCTATGGAGGGAGTTGCGGAAATTAGATTATTTAATCGTAACAGCTCAAATTACGATAAGGTCATTGACTTAT 
CAGATAAAATTAAAAAACAGTTTCAAATAAAGGTAGTCGTTGATTATCTAGAAAATAAGACAGCATTTAAAGACGCTATT 
AGAACTAGTCATTTTTATATTGATGCTACTAGTTTAGGAATGAGGCCATTAGATAATTATAGTTTAATTAACGATCCAGA 
TATTTTAACACCGAATTTAGTAGTTGTCGACTT 

SEQ ID NO. 1904: SAG1680 FROM THE M781 GBS TYPE III STRAIN 

AAATCAGCATCCCTAGACATTATAAGCATGTTTCACTCCATTTTGTCTAACAAATCGTAACAATGCTGTTTCTTTAGGCT 
TGTAAACCAAGTCGACAACTACTAAATTCGGTGTTAAAATTTCTGGATCGTTAATTAAACTATAATTATCTAATGGCCTC 
ATTCCTAAACTAGTAGCAT CAATATAAAAATGAC TAGTT CTAATAGCGTCTTTAAATGCTGTCTTATTTTC TAGATAATC 
AACGACTACCTTTATTTGAAACTGTTTTTTAATTTTATCTGATAAGTCAATGACCTTATCGTAATTTGAGCTGTTACGAT 
TAAATAATCTAATTTCCGCAACTCCCTCCATAGCTGCTTGAACTGCAACTGCTTTACCTGAACCACCAATACCAGCTATT 
GTAATTATTTTATTTTTAGCACTGAAACCTTGAGCTGCTAAAGCTTTAAAACAACCAATGCCATCTGTCATATGGCCTAC 
TAAACGTCCGGTTCCACCTTGATTAACGATAGTATTTACAGCACCCACTAATTTAGCTTGAGGAGATAAATCATCTAGCA 
AAGGGATAACACTCTGTTTAAATGGCATTGAAACATTAACACCACGAATACTCAATGCCCTGACACCTCGAACAGCTTCT 
GTTAATTTACCCTCTTCTACTTCAAATGTCAGATAGGCATAATTCATGTTTTTTTCTTGAAAAGAGGTATTCCACATTAA 
CGGGGATAGAGAGTGGCGTGCA 

SEQ ID NO. 1905: SAG1680 FROM THE 090 GBS TYPE la STRAIN 

GTTCGAGGTGTCAGGGCATTGGGTATTCGTGGTGTTAATGTTTCAATGCCATTTAAACAGAGTGTTATCCCtTTGCTArA 
TGATTTATCTCCTCAAGCTAAATTAGTGGGTGCTGTAAATACTATCGTTAATCAAGGTGGAACCGsACGTTTAGTAGGCC 
ATATGACAGATGGCATTGGTTGTTTTAAAGCTTTAGCAGCTCAAGGTTTCAGTGCTAAAAATAAAATAGTTACAATAGCT 
GGTATTGGTGGTTCAGGTAAAGCAGTTGCAGTTCAAGCAGCTATGGAGGGAGTTGCGGAAATTAGATTATTTAATCGTAA 
TAGCTCAAATTACGATAAGGTCATTGACTTATCAGATAAAATTAAAAAACAGTTTCAAATAAAGGTAGTCGTTGATTATC 
TAGAAAATAAGACAGCATTTAAAGACGCTATTAGAACTAGTCATTTTTATATTGATGCTACTAGTTTAGGAATGArGCCA 
TTAGATAATTATAGTTTAATTAACGATCCAGAAATTTTAACACCCAATTTAGTAGTTGTCGACTTGGTTTACAAGCCTAA 
AGAAACAGCATTGTTACGATTTGTTAGACAAAATGGAGTGAAACATGCTTATAATGGTCTAGGGATGCTGATTTATCAAG 
GAGCAGA 
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Table 19: Comparative Sequences relating to SAG1680 (shikimate 5-dehydrogenase) 



SEQ ID NO. 1906: SAG1680 FROM THE A909 GBS TYPE la STRAIN 

CCCTAGACCATTATAATCATGTTTCACTCCATTTTGTCTAACAAATCGTAACAATGCTGTTTCTTTAGGCTTGTAAACCA 
AGTCGACAACTACTAAATTCGGTGTTAAAATTTCTGGATCGTTAATTAAACTATAATTATCTAATGGCCTCATTCCTAAA 
CTAGTAGCATCAATATAAAAATGACTAGTTCTAATAGCGTCTTTAAATGCTGTCTTATTTTCTAGATAATCAACGACTAC 
CTTTATTTGAAACTGTTTTTTAATTTTATCTGATAAGTCAATGACCTTATCGTAATTTGAGCTGTTACGATTAAATAATC 
TAATTTCCGCAACTCCCTCCATAGCTGCTTGAACTGCAACTGCTTTACCTGAACCACCAATACCAGCTATTGTAATTATT 
TTATTTTTAGCACTGAAACCTTGAGCTGCTAAAGCTTTAAAACAACCAATGCCATCTGTCATATGGCCTACTAAACGTCC 
GGTT CCAC CTTGATTAACGATAGTATTTACAGC ACCCACTAATTTAGCTTGAGGAGATAAAT CAT CTAGC AAAGGGATAA 
CACTCTGTTTAAATGGCATTGAAACATTAACACCACGAATACCCAATGCCCTGACACCTCGAACAGCTTCTGTTAATTTA 
CCCT CTT CTACTT CAAATGTC AGATAGGCATAAT TC ATGT TTTTTTCTTGAAAAGAGGTATTC C ACATTAACGGGGATAG 

SEQ ID NO. 1907: SAG1680 FROM THE COH1 GBS TYPE la STRAIN 

TGCACGC C ACTCT CTATC C CCGTTAATGTGGAATACC TCTTTTAAGAAAAAAAC ATGAATTATGC C TATCTGACATTTGA 
AGTAGAAGAGGGTAAATTAACAGAAGCTGTTCGAGGTGTCAGGGCATTGAGTATTCGTGGTGTTAATGTTTCAATGCCAT 
TTAAACAGAGTGTTATCC CTTTGCTAGATGATTTATCTC C TCAAGCTAAATTAGTGGGTGCTGTAAATACT 

SEQ ID NO. 1908: SAG1680 FROM THE CJB110 GBS NONTYPEABLE STRAIN 

ATTCGTTATTAATTGAAATGCTTCTGCTCCTTGATAAATCAGCATCCCTAGACCATTATAAGCATGTTTCACTCCATTTT 
GTCTAACAAATCGTAACAATGCTGTTTCTTTAGGCTTGTAAACCAAGTCGACAACTACTAAATTGGGTGTTAAAATTTCT 
GGATCGTTAATTAAACTATAATTATCTAATGGCCTCATTCCTAAACTAGTAGCATCAATATAAAAATGACTAGTTCTAAT 
AGCGTCTTTAAATGCTGTCTTATTTTCTAGATAATCAACGACTACCTTTATTTGAAACTGTTTTTTAATTTTATCTGATA 
AGTCAATGACCTTATCGTAATTTGAGCTATTACGATTAAATAATCTAATTTCCGCAACTCCCTCCATAACTGCTTGAACT 
GCAACTGCTTTACCTGAACCACCAATACCAGCTATTGTAACTATTTT 

SEQ ID NO. 1909: SAG1680 FROM THE CJB110 GBS NONTYPEABIiE STRAIN 

ACTCTCTATCCCCGTTAATGTGGAATACCTCTTTTCAAGAAAAAAACATGAATTATGCCTATCTGACATTTGAAGTAGAA 
GAGGGTAAATTAACAGAAGCTGTTCGAGGTGTCAGGGCATTGGGTATTCGTGGTGTTAATGTTTCAATGCCATTTAAACA 
GAGTGTTATCCCTTTGCTAGATGATTTATCTCCTCAAGCTAAATTAGTGGGTGCTGTAAATACTATCGTTAATCAAGGTG 
GAACCGGACGTTTAGTAGGCCATATGACAGATGGCATTGGTTGTTTTAAAGCTTTAGCAGCTCAAGGTTTCAGTGCTAAA 
AATAAAATAGTTACAATAGCTGGTATTGGTG 

SEQ ID NO. 1910: SAG1680 FROM THE 1169NT1 GBS TYPE V STRAIN 

ATTCGTTATTAATTGAAATGCTTCTGCTC CTTGATAAAT CAGCATCC CTAGACC ATTATAAGC ATGTTT C ACTC CATTTT 
GTCTAACAAATCGTAACAATGCTGTTTCTTTAGGCTTGTAAACCAAGTCGACAACTACTAAATTCGGTGTTAAAATTTCT 
GGATCGTTAATTAAAC TATAATTAT C TAATGGC CT CATTC CTAAACTAGTAGCAT C AATATAAAAATGAC TAGTTCTAAT 
AGCGTCTTTAAATGCTGTCTTATTTTCTAGATAATCAACGACTACCTTTATTTGAAACTGTTTTTTAATTTTATCTGATA 
AGTCAATGACCTTATCGTAATTTGAGCTGTTACGAT 

* 

SEQ ID NO. 1911: SAG1680 FROM THE 1169NT1 GBS TYPE V STRAIN 

ACTTCTCTATTCCCCGTTAATGTGGAATACCTCTTTTCAAGAAAAAAACATGAATTATGCCTATCTGACATTTGAAGTAG 
AAGAGGGTAAATTAACAGAAGCTGTTCGAGGTGTCAGGGCATTGGGTATTCGTGGTGTTAATGTTTCAATGCCATTTAAA 
CAGAGTGTTATCCCTTTGCTAGATGATTTATCTCCTCAAGCTAAATTAGTGGGTGCTGTAAATACTATCGTTAATCAAGG 
TGGAACC 

SEQ ID NO. 1912: SAG1680 FROM THE 18RS21 GBS TYPE II STRAIN 

TCGTTATTAATTGAAATGCTTCTGCTCCTTGATAAATCATCATCCCTAGACCATTATAAGCATGTTTCACTCCATTTTGT 
CTAACAAATCGTAACAATGCTGTTTCTTTAGGCTTGTAAACCAAGTCGACAACTACTAAATTCGGTGTTAAAATTTCTGG 
ATCGTTAATTAAACTATAATTATCTAATGGCCT CATTC CTAAAC TAGTAGC AT CAATATAAAAATGAC T AGTTCTAATAG 
CGTCTTTAAATGCTGTCTTATTTTCTAGATAATCAACGACTACCTTTATTTGAAACTGTTTTTTAATTTTATCTGATAAG 
TCAATGACCTTATCGTAATTTGAGCTGTTACGATTAAATAATCTAATTTCCGCAAC 

SEQ ID NO. 1913: SAG1680 FROM THE 18RS21 GBS TYPE II STRAIN 

ATGCCTATCTGACATTTGAAGTAGAAGAGGGTAAATTAACAGAAGCTGTTCGAGGTGTCAGGGCATTGGGTATTCGTGGT 
GTTAATGTTTCAATGCCATTTAAACAGAGTGTTATC CC TTTGCTAGATGATTTAT CTC CTCAAGC TAAATTAGTGGGTGC 
TGTAAATACTAT CGTTAAT CAAGGTGGAAC CGGACGTTTAGTAGGC C ATATGACAGATGGC ATTGGTTGTTTTAAAGCTT 
TAGCAGCTCAAGGTTTCAGTGCTAAAAATAAAATAATTACAATAGCTGGTATTGGTGGTTCAGGTAAAGCAGTTGCAGTT 
CAAGCAGCTATGGAGGGAGTTGCGG 
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Table 19: Comparative Sequences relating to SAG1680 (shikimate 5-dehydrogenase) 



SEQ ID NO. 1914: SAG1680 FROM THE JM9130013 GBS TYPE VIII STRAIN 

CCCTAGACCATTATAAGTCATGTTTCACTCCATTTTGTCTAACAAATCGTAACAATGCTGTTTCTTTAGGCTTGTAAACC 
AAGTCGACAACTACTAAATTGGGTGTTAAAATTTCTGGATCGTTAATTAAACTATAATTATCTAATGGCCTCATTCCTAA 
ACTAGTAGCATCAATATAAAAATGACTAGTTCTAATAGCGTCTTTAAATGCTGTCTTATTTTCTAGATAATCAACGACTA 
CCTTTATTT<3AAACTGTTTTTTAATTTTATCTGATAAGTCAATGACCTTATCGTAATTTGAGCTATTACGATTAAATAAT 
CTAATTTCCGCAACTCCCTCCATAGCTGCTTGAACTGCAACTGCTTTACCTGAACCACCAATACCAGCTATTGTAACTAT 
TTTATTTTTAGCACTGAAACCTTGAGCTGCTAAAGCTTTAAAACAACCAATGCCATCTGTCAT 



SEQ1901 : ATCCCT 

SEQ1902 GTTATTAATTGAAATGCTTCTGCTCCTTGATAAATCAGC ATCCCT 

SEQ1903 TGGTCTAATTGCCAATCCTGCACGCCACTCTCTAT- CCCCGTTAATGTGGAATACCTCT 

SEQ1904 AAATCAGCATCCCT 

SEQ1905 

SEQ1906 CCCT 

SEQ1907 TGCACGCCACTCTCTAT- CCCCGTTAATGTGGAATACCTCT 

SEQ19 0 8 ATTCGTTATTAATTGAAATGCTTCTGCTCCTTGATAAATCAGCATCCCT 

SEQ1909 ACTCTCTAT - CCCCGTTAATGTGGAATACCTCT 

SEQ1910 -ATTCGTTATTAATTGAAATGCTTCTGCTCCTTGATAAATCAGCATCCCT 

SEQ1911 ACTTCTCTATTCCCCGTTAATGTGGAATACCTCT 

SEQ1912 - - TCGTTATTAATTGAAATGCTTCTGCTCCTTGATAAATCATCATCCCT 

SEQ1913 

SEQ1914 CCCT 

SEQ1901 GACCATTATAAG-CATGTTTCACTCCATTTTGTCTAACAAATCGTAACAATGCTGTTTC 

SEQ1902 GACCATTATAAG-CATGTTTCACTCCATTTTGTCTAACAAATCGTAACAATGCTGTTTC 

SEQ1903 TTCAAGAAAAAAACATGAATTATGCCTATCTGACATTTGAAGTAGAAGAGGGTAAATTA 

SEQ1904 GAC - ATTATAAG- CATGTTTCACTCCATTTTGTCTAACAAATCGTAACAATGCTGTTTC 

SEQ1905 

SEQ1906 GACCATTATAAT - CATGTTTCACTCCATTTTGTCTAACAAATCGTAACAATGCTGTTTC 

SEQ1 907 TT - AAGAAAAAAACATGAATTATGC CTATCTGACATTTGAAGTAGAAGAGGGTAAATTA 

SEQ19 0 8 GACCATTATAAG - CATGTTTCACTCCATTTTGTCTAACAAATCGTAACAATGCTGTTTC 

SEQ1 909 TTCAAGAAAAAAACATGAATTATGCCTATCTGACATTTGAAGTAGAAGAGGGTAAATTA 

SEQ1910 GACCATTATAAG - CATGTTTCACTCCATTTTGTCTAACAAATCGTAACAATGCTGTTTC 

SEQ1911 TTCAAGAAAAAAACATGAATTATGCCTATCTGACATTTGAAGTAGAAGAGGGTAAATTA 

SEQ1912 GACCATTATAAG- CATGTTTCACTCCATTTTGTCTAACAAATCGTAACAATGCTGTTTC 

SEQ1913 ATGCCTATCTGACATTTGAAGTAGAAGAGGGTAAATTA 

SEQ1914 GACCATTATAAGTCATGTTTCACTCCATTTTGTCTAACAAATCGTAACAATGCTGTTTC 

SEQ1901 TTAGGCTTGTAAACCAAGTC - - GACAACTACTAAATTCGGTGTTAAAATTTCTGGATCG 

SEQ 1 9 0 2 TTAGGCTTGTAAACCAAGTC - - GACAACTACTAAATTCGGTGTTAAAATTTCTGGATCG 

SEQ 1 9 0 3 CAGAAGCTGTTCGAGGTGTCAGGGCATTGAGTATTCGTGGTGTTAATGTTTCAATGCC A 

SEQ1904 TTAGGCTTGTAAACCAAGTC- -GACAACTACTAAATTCGGTGTTAAAATTTCTGGATCG 

SEQ1 9 05 GTTCGAGGTGTCAGGGCATTGGGTATTCGTGGTGTTAATGTTTCAATGCCA 

SEQ1 9 0 6 TTAGGCTTGTAAACCAAGTC - - GACAACTACTAAATTCGGTGTTAAAATTTCTGGATCG 

SEQ1907 CAGAAGCTGTTCGAGGTGTCAGGGCATTGAGTATTCGTGGTGTTAATGTTTCAATGCCA 

SEQ190 8 TTAGGCTTGTAAACCAAGTC - - GACAACTACTAAATTGGGTGTTAAAATTTCTGGATCG 

SEQ1909 CAGAAGCTGTTCGAGGTGTCAGGGCATTGGGTATTCGTGGTGTTAATGTTTCAATGCCA 

SEQ1 9 1 0 TTAGGCTTGTAAACCAAGTC - - GACAACTACTAAATTCGGTGTTAAAATTTCTGGATCG 

SEQ1911 CAGAAGCTGTTCGAGGTGTCAGGGCATTGGGTATTCGTGGTGTTAATGTTTCAATGCCA 

SEQ1 9 12 TTAGGCTTGTAAACCAAGTC - - GACAACTACTAAATTCGGTGTTAAAATTTCTGGATCG 

SEQ1913 CAGAAGCTGTTCGAGGTGTCAGGGCATTGGGTATTCGTGGTGTTAATGTTTCAATGCCA 

SEQ1914 TTAGGCTTGTAAACCAAGTC- -GACAACTACTAAATTGGGTGTTAAAATTTCTGGATCG 
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Table 19: Comparative Sequences relating to SAG1680 (shikimate 5-dehydrogenase) 



SEQ19 0 1 TT - AATTAAACTATAATTATCT AATGGCCTCATTCCT- AAACTAGTAGCATCAAT 

SEQ1 902 TT - AATTAAACTATAATTATCT AATGGCCTCATTCCT- AAACTAGTAGCATCAAT 

SEQ1903 TTTAAACAGAGTGTTATCCCTTTGCTAGATGATTTATCTCCTCAAGCTAAATTAGTGGGT 

SEQ1 9 0 4 TT - AATTAAACTATAATTATCT AATGGCCTCATTCCT - AAACTAGTAGCATCAAT 

SEQ1 9 0 5 TTTAAACAGAGTGTTATCCCTTTGCTARATGATTTATCTCCTCAAGCTAAATTAGTGGGT 

SEQ1 9 0 6 TT - AATTAAACTATAATTATCT AATGGCCTCATTCCT - AAACTAGTAGCATCAAT 

SEQ1 9 0 7 TTTAAAC AGAGTGTTATC C CTTTGCTAGATGATTTATCTC CTC AAGCTAAATTAGTGGGT 

SEQ19 0 8 TT - AATTAAACTATAATTATCT AATGGCCTCATTCCT- AAACTAGTAGCATCAAT 

SEQ1909 TTTAAACAGAGTGTTATCCCTTTGCTAGATGATTTATCTCCTCAAGCTAAATTAGTGGGT 

SEQ19 10 TT - AATTAAACTATAATTATCT AATGGCCTCATTCCT -AAACTAGTAGCATCAAT 

SEQ1911 TTTAAACAGAGTGTTATCCCTTTGCTAGATGATTTATCTCCTCAAGCTAAATTAGTGGGT 

SEQ19 12 TT - AATTAAACTATAATTATCT AATGGCCTCATTCCT - AAACTAGTAGCATCAAT 

SEQl 913 TTTAAACAGAGTGTTATCCCTTTGCTAGATGATTTATCTCCTCAAGCTAAATTAGTGGGT 

SEQ 1 9 14 TT - AATTAAACTATAATTATCT - AATGGCCTCATTCCT - AAACTAGTAGCATCAAT 

SEQ1901 TAAAAATGACTAGTTCTAATAGCGTCTTTAAATGCTGTCTTATTTTCTAGATAATCAAC 

SEQ19 0 2 TAAAAATGACTAGTTCTAATAGCGT CTTTAAATGC TGT CTTATTTTCTAGATAATCAAC 

SEQ1 9 0 3 CTGTAAATACTATCGTTAATCAAGGTGGAACCGGACGTTTAGTAGGCCATATGACAGAT 

SEQ1 9 0 4 TAAAAATGACTAGTTCTAATAGCGTCTTTAAATGCTGTCTTATTTTCTAGATAATCAAC 

SEQ1 9 0 5 CTGTAAATACTATCGTTAATCAAGGTGGAACCGSACGTTTAGTAGGCCATATGACAGAT 

SEQ1906 TAAAAATGACTAGTTCTAATAGCGTCTTTAAATGCTGTCTTATTTTCTAGATAATCAAC 

SEQ1907 C TGTAAATACT 

SEQ1 9 0 8 TAAAAATGACTAGTTCTAATAGCGTCTTTAAATGCTGTCTTATTTTCTAGATAATCAAC 

SEQ1 9 0 9 CTGTAAATACTATCGTTAATCAAGGTGGAAC CGGACGTTTAGTAGGC C ATATGACAGAT 

SEQ1 9 1 0 TAAAAATGACTAGTTCTAATAGCGTCTTTAAATGCTGTCTTATTTTCTAGATAATCAAC 

SEQ1 9 1 1 CTGTAAATACTATCGTTAATCAAGGTGGAACC 

SEQ1 9 12 TAAAAATGACTAGTTCTAATAGCGTCTTTAAATGCTGTCTTATTTTCTAGATAATCAAC 

SEQ1 9 13 C T GT AAAT AC TATCGT T AAT C AAGG T GG AAC C GG AC GT TT AGT AGG C C AT ATG AC AGAT 

SEQ1 9 14 TAAAAATGACTAGTTCTAATAGCGTCTTTAAATGCTGTCTTATTTTCTAGATAATCAAC 

S EQ1 9 0 1 ACTACCTTTATTTGAAACTGTTTTTTAATTTTATCTGATAAGTCAATGACCTTATCGTA 

SEQ1 9 0 2 ACTACCTTTATTTGAAACTGTTTTTTAATTTTATCTGATAAGTCAATGACCTTATCGTA 

SEQl 9 0 3 GCATTGGTTGTTTTAAAGCTTTAGCAGCTCAAGGTTTCAGTGCTAAAAATAAAATAATT 

S EQ1 9 0 4 ACTACCTTTATTTGAAACTGTTTTTTAATTTTATCTGATAAGTCAATGACCTTATCGTA 

SEQ1 905 GCATTGGTTGTTTTAAAGCTTTAGCAGCTCAAGGTTTCAGTGCTAAAAATAAAATAGTT 

SEQ 1 9 0 6 ACTACCTTTATTTGAAACTGTTTTTTAATTTTATCTGATAAGTCAATGACCTTATCGTA 

SEQ1907 

SEQ 1 9 0 8 ACTAC CTTTATTTGAAACTGTTTTTTAATTTTATCTGATAAGTCAATGACCTTATCGTA 

SEQ1 9 0 9 GCATTGGTTGTTTTAAAGCTTTAGCAGCTCAAGGTTTCAGTGCTAAAAATAAAATAGTT 

SEQ 1 9 1 0 ACTACCTTTATTTGAAACTGTTTTTTAATTTTATCTGATAAGTCAATGACCTTATCGTA 

SEQ1911 

SEQ1 9 12 ACTACCTTTATTTGAAACTGTTTTTTAATTTTATCTGATAAGTCAATGACCTTATCGTA 

SEQ1913 GCATTGGTTGTTTTAAAGCTTTAGCAGCTCAAGGTTTCAGTGCTAAAAATAAAATAATT 

SEQ1914 ACTACCTTTATTTGAAACTGTTTTTTAATTTTATCTGATAAGTCAATGACCTTATCGTA 

SEQ190 1 TTTGAGCTGTTACGATTAAATAATCTAATTTCCGCAACTCCCTCCATAGCTGCTTGAAC 

SEQ190 2 TTTGAGCTGTTACGATTAAATAATCTAATTTCCGCAACTCCCTCCATAGCTGCTTGAAC 

SEQ1 9 0 3 C AATAGCTGGTATTGGTGGTTCAGGTAAAGCAGTTGCAGTTCAAGCAGCTATGGAGGGA 

SEQ1 9 0 4 TTTGAGCTGTTACGATTAAATAATCTAATTTCCGCAACTCCCTCCATAGCTGCTTGAAC 

SEQ1 9 0 5 CAATAGCTGGTATTGGTGGTTCAGGTAAAGCAGTTGCAGTTCAAGCAGCTATGGAGGGA 

SEQ1 906 TTTGAGCTGTTACGATTAAATAATCTAATTTCCGCAACTCCCTCCATAGCTGCTTGAAC 

SEQ1907 

SEQ1908 TTTGAGCTATTACGATTAAATAATCTAATTTCCGCAACTCCCTCCATAACTGCTTGAAC 

SEQ1909 CAATAGCTGGTATTGGTG 

SEQ 1 9 1 0 TTTGAGCTGTTACGAT 

SEQ1911 

SEQ1912 TTTGAGCTGTTACGATTAAATAATCTAATTTCCGCAAC 

SEQ1913 C AAT AG C TGGTAT TGGTGGTTC AGGTAAAG C AGTTGC AGTT CAAGC AGCTATGGAGGGA 

SEQ1 9 14 TTTGAGCTATTACGATTAAATAATCTAATTTCCGCAACTCCCTCCATAGCTGCTTGAAC 
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Table 19: Comparative Sequences relating to SAG1680 (shikimate 5-dehydrogenase) 



SEQ1901 
SEQ1902 
SEQ1903 
SEQ1904 
SEQ1905 
SEQ1906 
SEQ1907 
SEQ1908 
SEQ1909 
SEQ1910 
SEQ1911 
SEQ1912 
SEQ1913 
SEQ1914 



GCAACTGCTTTACCTGAACCACCAATACCAGCTATTGTAATTATTTTATTTTTAGCACT 
GCAACTGCTTTACCTGAACCACCAATACCAGCTATTGTAATTATTTTATTTTTAGCACT 
TTGCGGAAATTAGATTATTTAATCGTAACAGCTCAAATTACGATAAGGTCATTGACTTA 
GCAACTGCTTTACCTGAACCACCAATACCAGCTATTGTAATTATTTTATTTTTAGCACT 
TTGCGGAAATTAGATTATTTAAT CGTAATAGC T CAAATTACGATAAGGTCATTGAC TTA 
GCAACTGCTTTACCTGAACCACCAATACCAGCTATTGTAATTATTTTATTTTTAGCACT 

GCAACTGCTTTACCTGAACCACCAATACCAGCTATTGTAACTATTTT 

TTGCGG 

GCAACTGCTTTACCTGAACCACCAATACCAGCTATTGTAACTATTTTATTTTTAGCACT 



SEQ1901 
SEQ1902 
SEQ1903 
SEQ1904 
SEQ1905 
SEQ190 6 
SEQ1907 
SEQ1908 
SEQ1909 
SEQ1910 
SEQ1911 
SEQ1912 
SEQ1913 
SEQ1914 



AAACCTTGAGCTGCTAAAGCTTTAAAACAAC C AATGC CATC TGTCATATGGCC TACTAA 
AAACCTTGAGCTGCTAAAGCTTTAAAACAACCAATGCCATCTGTCATATGGCCTACTAA 
CAGATAAAATTAAAAAACAGTTTCAAATAAAGGTAGTCGTTGATTATCTAGAAAATAAG 
AAACCTTGAGCTGCTAAAGCTTTAAAACAACCAATGCCATCTGTCATATGGCCTACTAA 
CAGATAAAATTAAAAAACAGTTTCAAATAAAGGTAGTCGTTGATTATCTAGAAAATAAG 
AAACCTTGAGCTGCTAAAGCTTTAAAACAACCAATGCCATCTGTCATATGGCCTACTAA 



AAACCTTGAGCTGCTAAAGCTTTAAAACAACCAATGCCATCTGTCAT - TABCMARAT - - 



SEQ1901 
SEQ1902 
SEQ1903 
SEQ1904 
SEQ1905 
SEQ1906 
SEQ1907 
SEQ1908 
SEQ1909 
SEQ1910 
SEQ1911 
SEQ1912 
SEQ1913 
SEQ1914 



CGT CCGGTTCCACC TTGATT AACGATAGT ATTTAC AGCACCC ACTAATTTAGCTTGAGG 
CGTCCGGTTCCACCTTGATTAACGATAGTATTTACAGCACCCACTAATTTAGCTTGAGG 
CAGCATTTAAAGACGCTATTAGAACTAGTCATTTTTATATTGATGCTACTAGTTTAGGA 
CGTCCGGTTCCACCTTGATTAACGATAGTATTTACAGCACCCACTAATTTAGCTTGAGG 
CAGCATTTAAAGACGCTATTAGAACTAGTCATTTTTATATTGATGCTACTAGTTTAGGA 
CGTCCGGTTCCACCTTGATTAACGATAGTATTTACAGCACCCACTAATTTAGCTTGAGG 



STNCSRATNGTSASHKMATDHYDRGNAS ■ 



SEQ1901 
SEQ1902 
SEQ1903 
SEQ1904 
SEQ1905 
SEQ1906 
SEQ1907 
SEQ1908 
SEQ1909 
SEQ1910 
SEQ1911 
SEQ1912 
SEQ1913 
SEQ1914 



GATAAATCATCTAGCAAAGGGATAACACTCTGTTTAAATGGCATTGAAACATTAACACC 
GATAAATCATC TAGCAAAGGGATAAC ACTCTGTT TAAATGGCATTGAAAC ATTAAC ACC 
TGAGGCCATTAGATAATTATAGTTTAATTAACGATCCAGATATTTTAACACCGAATTTA 
GATAAATCATC TAGCAAAGGGATAAC ACTCTGTTTAAATGGCATTGAAACAT T AAC AC C 
TGARGCCATTAGATAATTATAGTTTAATTAACGATCCAGAAATTTTAACACCCAATTTA 
GATAAATCATCTAGCAAAGGGATAACACTCTGTTTAAATGGCATTGAAACATTAACACC 
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Table 19: Comparative Sequences relating to SAG1680 (shikimate 5-dehydrogenase) 



SEQ1901 
SEQ1902 
SEQ1903 
SEQ1904 
SEQ1905 
SEQ1906 
SEQ1907 
SEQ1908 
SEQ1909 
SEQ1910 
SEQ1911 
SEQ1912 
SEQ1913 
SEQ1914 

SEQ1901 
SEQ1902 
SEQ1903 
SEQ1904 
SEQ1905 
SEQ1906 
SEQ1907 
SEQ1908 
SEQ1909 
SEQ1910 
SEQ1911 
SEQ1912 
SEQ1913 
SEQ1914 

SEQ1901 
SEQ1902 
SEQ1903 
SEQ1904 
SEQ1905 
SEQ1906 
SEQ1907 
SEQ1908 
SEQ1909 
SEQ1910 
SEQ1911 
SEQ1912 
SEQ1913 
SEQ1914 



CGAATACCCAATGCCCTGACACCTCGAACAGCTTCTGTTAATTTACCCTCTTCTACTTC 

CGAATACCCAATGCCCTGACACCTCGAACAGCTTCTGTTAATTTACCCTCTTCTACTTC 
TAGTTGTCGACTT 

CGAATACTCAATGCCCTGACACCTCGAACAGCTTCTGTTAATTTACCCTCTTCTACTTC 
TAGTTGTCGACTTGGTTTACAAGCCTAAAGAAACAGCATTGTTACGATTTGTTAGACAA 
CGAATACCCAATGCCCTGACACCTCGAACAGCTTCTGTTAATTTACCCTCTTCTACTTC 

AATGTCAGATAGGCATAATTCATGTTTTTTTCTTGAAAAGAGGTATTCCACATTAACGG 
AATGTCAGATAGGCATAATTCATGTTTTTTTCTTGAAAAGAGGTATTCCACATTAACGG 

AATGTC AGATAGGC ATAATT CATGTTTTTTTCTTGAAAAGAGGTATT C CAC ATTAACGG 

ATGGAGTGAAACATGCTTATAATGGTCTAGGGATGCTGATTTATCAAGGAGCAGA 

AATGTCAGATAGGCATAATTCATGTTTTTTTCTTGAAAAGAGGTATTCCACATTAACGG 

GATAGAGAGTGGCGTGC AGG - 
GATAGAGAGTGGCGTGCAGGA 

GATAGAGAGTGGCGTGCA 

GAT AG 
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Table 20: Comparative Sequences relating to SAG1723 (signal peptidase I) 



SEQ ID NO. 2001: SAG1723 FROM THE COHl GBS TYPE la STRAIN 

ATCGATTCGATATTGTAGTGGCTAACGAAGAAGAAGGCGGCCAAAAGAAAA 

ATCAAATATAAAAATGACAC C T TAACT ATTAAGAATAAAAAAACAGAAGAAC C T TAC C T CAAGGAATATACTAAAT TAT T TAAAAAGGA 
TAAATTACAGGAAAAATATTCGTATAACCC!ACTTTTCCIA^ 

GCX3AATTTACTACTGTCGTGCCTAAAGGCCACTACTATCTTGTTGGTGATGACCGAATTGTCTCTAAAGATAGTCGTGCCGTCGGTTCC 
TTCAAAA 

SEQ ID NO, 2002: SAG1680 FROM THE CJBllO GBS NONTYPEABLE STRAIN (REVERSE 
COMPLEMENT) 

TAAAGTTGACGGACACTCCATGGATCCAACT^ 

TTGTAGTGGCTAACGAAGAAGAAGGCGGCCAAAAGAAAAAAATTGTTAAACGTGTC^^ 
AATGACACCTTAACTATTAACAATAAAAAAACAGAAGAACCTT^ 

AAAATATTCGTATAACCCACTTTTCCAAGACCTAGCACAAAGCTCTACCGCTTTCACTA^ 

CTGTCGTGCCTAAAGGCCACTATTATCTTGTTGGTGATGACCGAATTGTCTCTAAAGATAGTCGTGCCGTCGGTCCCTTCAAAAAATCA 
ACAATTGTGGGAG 

SEQ ID NO- 2003: SAG1680 FROM THE 18RS21 GBS TYPE II STRAIN 

TTGACGGACACTCCATGGATCCAACTTTAGCTGACAAGGAACAGCT^ 

GTGGCTAACGAAGAAGAAGG CGGC CAAAAGAAAAAAATTGTTAAACGTGTCATTGGTATGCCAGGTGATGTCAT CAAATATAAAAATGA 
CACC TTAACTAT TAACAATAAAAAAACAGAAGAACCT TAC CT CAAGGAATATACTAAAT TAT T T AAAAAGGATAAATTACAGGAAAAAT 
ATTCGTATAACCCACTTTTCCAAGACCTAGCACAAAGCTC^ 

GTGCCTAAAGGCCACTACTATCTTGTTGGTGATGACCGAATTGTCTCTAAAGATAGTCGTGCCGTCGGTCCCTTCAAAAAATCAACGAT 
TGTGGGAGAGGT 

SEQ ID NO. 2004: SAG1680 FROM THE 2603 V/R GBS TYPE V STRAIN (REVERSE 
COMPLEMENT) 

AAGTTGACGGACACTCCATGGATCCAACTTTAGCTGACAAGGAACAGCTAGTAGT^ 
GTAGTGGCTAACGAAGAAGAAGGCGGCCAAAAGAAAAAAATTGTTAAACGT 

TGACACCTTAACTATTAACAATAAAAAAACAGAAGAACCTTACCTCAAGGAATATACTAAATTATTTAA 
AATATTCGTATAACCCACTTTTCGAAGACCTAGCACAAAGCTCTACCGCTTT 

GTCGTGCCTAAAGGCCACTACTATCTTGTTGGTGATGACCGAATTGTCTCTAAAGATAGTCGTGCCGTCGGT 

SEQ ID NO. 2005: SAGI680 FROM THE M732 GBS TYPE III STRAIN (REVERSE COMPLEMENT) 

TTGACGGACACTCCATGGATCCAACTTTAGCTGAC 

GGCTAACGAAGAAGAAGGCGGCC1AAAAGAAAAAAATTGTTAAACGTGTCATTGGTATC 
CCTTAACTATTAACAATAAAAAAACAGAAGAACCTTACCT(^GGAATATACTAAATTATTTAAAAAGGAT 
TCGTATAACCC^CTTTTCCAAGACCTAGCACAAAGCT^ 
GCCTAAAGGCCACTACTATCTTGTTGGTGATGACCGA 

SEQ ID NO. 2006: SAG1680 FROM THE M781 GBS TYPE III STRAIN 

TTGACGGACACTCCATGGATCC^^CTTTAGCTGA^ 
GTGGCTAACGAAGAAGAAGGCGGCCAAAAGAAAAAAATTGTTAAA 
CACCTTAACTATTAACAATAAAAAAACAGAAGAACCTTACCTCAA 
TATTCGTATAACCCACTTTTCCAAGACCTAGCACAAAGCTCTACC^ 

SEQ ID NO. 2007: SAG1680 FROM THE 1169NT1 GBS TYPE V STRAIN (REVERSE 
COMPLEMENT) 

TTGGTAAAGTTGACGGACACTCCATGGATCCAACTTTAG 

GATATTGTAGTGGCTAACGAAGAAGAAGGCGGCCAAAAGAAAAAAATTGTTAA^ 

TAAAAATGACACCTTAACTATTAACAATAAAAAAACAGAAGAACCTTACCTCA 

AGGAAAAATATTCGTATAACCCACTTTTCCAAGACCTAGCACAAAGCTCT^ 

ACCACTGTCGTGCCTAAAGGCCACTACTATCTTGTTGGTGATGACCGAATTGTCTCTAAAGATAGTCGTGCCGTCGGCCCCTTCAAAAA 
ATCAACG 

SEQ ID NO. 2008: SAG1680 FROM THE H36b GBS TYPE lb STRAIN (REVERSE COMPLEMENT) 

TTGACGGACACTCCATGGATCCAACTTTAGCTGAC^ 

GTGGCTAACGAAGAAGAAGGCGGCCAAAAGAAAAAAATTGTTAAACGTGTCATTGGTATGCCAGGTGATGTCA 
CACCTTAACTATTAACAATAAAAAAACAGAAGAACCTTACCTaAAGGAATATACTAAATTATTTAAAAA 
ATTCGTATAACCCACTTTTCCAAGACCTAGCACAAAGCTCTACCGCTTTCACCA 
GTGCCTAAAGGCCACTACTATCTTGTTGGTGATGACCGA 

SEQ ID NO. 2 009: SAG1680 FROM THE 090 GBS TYPE la STRAIN (REVERSE COMPLEMENT) 

TAAAGTTGACGGACACTCCATGGATCCAACTTTAGC^ 

TTGTAGTGGCTAACGAAGAAGAAGGCGGCCAAAAGAAAAAAATTGTTAAACGTGTCIATTGGTATGCCAGGTGATGTCAT 
AATGACACCTTAACTATTAACAATAAAAAAACAGAAGAACCTTACCT CAAGGAATATACTAAATTAT TTAAAAAGGATAAATTACAGGA 
AAAATATTCGTATAACCCACTTTTCCAAGACCTAGCACAAAGCTCTACCGCTTTCa 

CTGTCGTGCCTAAAGGCCACTATTATCTTGTTGGTGATGACCGAATTGTCTCTAAAGATAGTCGTGCCGTCGGT 
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SEQ ID NO- 2010: SAG1680 FROM THE A909 GBS TYPE la STRAIN (REVERSE COMPLEMENT) 

AAAGTTGACG<^CACTCGA.TGGATC 

TGTAGTGGCTAACGAAGAAGAAGGCGGCCAAAAGAAAAAAATTGTTAAACGTGTCATT 

ATGA(^CCTTAACTATTAACAATAAAAAAA(^GAAGAACCTTACCTCAAGGAATATACTAAATTATTT 

AAATATTCGTATAACCCACTTTTCCAAGACCTAGCACAAAGCTCTACCGCTTTCACCA 

TGTCGTGCCTAAAGGCC^CTACTATCTTGTTGGTGATGACCGAATTGTCTCTAAAGATAGTCGTGCCGTCG^ 
CG 



SEQ2001 

SEQ2 002 TAAAGTTGACGGACACTCCATGGATCCAACTTTAGCTGACAAGGAACAGCTAGTAG 

SEQ2 003 TTGACGGAC ACTC CATGGATC CAACTTTAGC TGACAAGGAACAGC TAGTAG 

SEQ2 004 AAGTTGACGGACACTCCATGGATCCAACTTTAGCTGACAAGGAACAGCTAGTAG 

SEQ2 005 TTGACGGACACTCCATGGATCCAACTTTAGCTGACAAGGAACAGCTAGTAG 

SEQ2 006 TTGACGGACACTCCATGGATCCAACTTTAGCTGACAAGGAACAGCTAGTAG 

SEQ2 007 TGGTAAAGTTGACGGACACTCCATGGATCCAACTTTAGCTGACAAGGAACAGCTAGTAG 

SEQ2 008 • TTGACGGACACTCCATGGATCCAACTTTAGCTGACAAGGAACAGCTAGTAG 

SEQ2 009 TAAAGTTGACGGACACTCCATGGATCCAACTTTAGCTGACAAGGAACAGCTAGTAG 

SEQ2010 AAAGTTGACGGACACTCCATGGATCCAACTTTAGCTGACAAGGAACAGCTAGTAG 

SEQ2001 , ATCGATTCGATATTGTAGTGGCTAACGAAGAAGAAGGCG 

SEQ2002 TCTCAAACAAACAAAAATCAATCGATTCGATATTGTAGTGGCTAACGAAGAAGAAGGCG 

SEQ2003 TCTCAAACAAACAAAAATCAATCGATTCGATATTGTAGTGGCTAACGAAGAAGAAGGCG 

SEQ2004 TCTCAAACAAACAAAAATCAATCGATTCGATATTGTAGTGGCTAACGAAGAAGAAGGCG 

SEQ2 005 TCTC AAACAAACAAAA - - TAATCGATTCGATATTGTAGTGGCTAACGAAGAAGAAGGCG 

SEQ2 006 TCTCAAACAAACAAAAATCAATCGATTCGATATTGTAGTGGCTAACGAAGAAGAAGGCG 

SEQ2 007 T C T C AAAC AAAC AAAAAT C AAT C GAT T CG AT AT TGTAGTGGC T AACGAAG AAGAAGG CG 

SEQ2 008 T C T C AAAC AAAC AAAAAT C AAT CGAT T C GAT AT TGT AGTGG C T AAC GAAGAAG AAG G C G 

SEQ2 009 TCTCAAACAAACAAAAATCAATCGATTCGATATTGTAGTGGCTAACGAAGAAGAAGGCG 

SEQ2010 TCTCAAACAAACAAAAATCAATCGATTCGATATTGTAGTGGCTAACGAAGAAGAAGGCG 

SEQ2 001 GCCAAAAGAAAAAAATTGTTAAACGTGTCATTGGTATGCCAGGTGATGTCATCAAATATA 

SEQ2 002 GCCAAAAGAAAAAAATTGTTAAACGTGTCATTGGTATGCCAGGTGATGTCATCAAATATA 

SEQ2 003 GCCAAAAGAAAAAAATTGTTAAACGTGTCATTGGTATGCCAGGTGATGTCATCAAATATA 

SEQ2 004 GCCAAAAGAAAAAAATTGTTAAACGTGTCATTGGTATGCCAGGTGATGTCATCAAATATA 

SEQ2 005 GCCAAAAGAAAAAAATTGTTAAACGTGTCATTGGTATGCCAGGTGATGTCATCAAATATA 

SEQ2 006 GCCAAAAGAAAAAAATTGTTAAACGTGTCATTGGTATGCCAGGTGATGTCATCAAATATA 

SEQ2 007 GCCAAAAGAAAAAAATTGTTAAACGTGTCATTGGTATGCCAGGTGATGTCATCAAATATA 

SEQ2 008 GCCAAAAGAAAAAAATTGTTAAACGTGTCATTGGTATGCCAGGTGATGTCATCAAATATA 

SEQ2 009 GCCAAAAGAAAAAAATTGTTAAACGTGT CATTGGTATGCCAGGTGATGTC AT CAAATATA 

SEQ2 010 GCCAAAAGAAAAAAATTGTTAAACGTGTCATTGGTATGCCAGGTGATGTCATCAAATATA 

SEQ2 001 AAAATGACACCTTAACTATTAACAATAAAAAAACAGAAGAACCTTACCTCAAGGAATATA 

SEQ2 002 AAAATGACACCTTAACTATTAACAATAAAAAAACAGAAGAACCTTACCTCAAGGAATATA 

SEQ2 003 AAAATGACACCTTAACTATTAACAATAAAAAAACAGAAGAACCTTACCTCAAGGAATATA 

S EQ2 004 AAAATGAC ACCTTAACTATTAACAATAAAAAAACAGAAGAAC C TTAC CTCAAGGAATATA 

SEQ2005 AAAATGACACCTTAACTATTAACAATAAAAAAACAGAAGAACCTTACCTCAAGGAATATA 

S EQ2 0 0 6 AAAATGACACCTTAACTATTAACAATAAAAAAACAGAAGAACCTTACCTGAAGGAATATA 

SEQ2007 AAAATGACAC CTTAACTATTAACAATAAAAAAAC AG AAGAACCTTAC CTCAAGGAATATA 

SEQ2 008 AAAATGACAC CTTAACTATTAAC AATAAAAAAACAGAAGAACC TTAC C TCAAGGAATAT A 

SEQ2 009 AAAATGACACCTTAACTATTAAC AATAAAAAAACAGAAGAACC TTAC CTCAAGGAATATA 

SEQ2010 AAAATGACAC CTTAACTATTAACAATAAAAAAACAG AAGAACC TTACCT CAAGGAATATA 

SEQ2 001 CTAAATTATTT- AAAAAGGATAAATTACAGGAAAAATATTCGTATAACCCACTTTTCCAA 

SEQ2 002 CTAAATTATTT - AAAAAGGATAAATTACAGGAAAAATATTCGTATAACCCACTTTTCCAA 

SEQ2003 CTAAATTATT T - AAAAAGGATAAATTACAGGAAAAATATTCGTATAACCC ACTTTTCC AA 

SEQ2004 CTAAATTATTT - AAAAAGG AT AAAT T AC AGGAAAAAT AT T C GT AT AAC C C AC T T T T C C AA 

SEQ2 005 CTAAATTATTT - AAAAAGGATAAATTACAGGAAAAATATTCGTATAACCCACTTTTCCAA 

SEQ2 006 CTAAATTATTTTAAAAAGGATAAATTAC AGGAAAAAT ATTCGTAT AAC C C ACT TTT CCAA 

SEQ2007 CTAAATTATTT - AAAAAGGATAAATTAC AGGAAAAATATTCGTAT AAC C C AC TT TTCCAA 

SEQ2 008 CTAAATTATT T - AAAAAGGATAAATTAC AGGAAAAATATTCGT AT AACC CAC TTTTCCAA 

SEQ2009 CTAAATTATTT - AAAAAGGATAAATTACAGGAAAAATATTCGTATAACCCACTTTTCCAA 

SEQ2010 CTAAATTATTT - AAAAAGGATAAATTACAGGAAAAATATTCGTATAACCCACTTTTCCAA 
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SEQ2001 
SEQ2002 
SEQ2003 
SEQ2 004 
SEQ2005 
SEQ2006 
SEQ2007 
SEQ2 008 
SEQ2009 
SEQ2010 

SEQ2001 
SEQ2002 
SEQ2003 
SEQ2004 
SEQ2005 
SEQ2 006 
SEQ2 007 
SEQ2008 
SEQ2009 
SEQ2010 

SEQ2 001 
SEQ2 002 
SEQ2003 
SEQ2004 
SEQ2005 
SEQ2 00 6 
SEQ2007 
SEQ2008 
SEQ2009 
SEQ2010 

SEQ2001 
SEQ2002 
SEQ2003 
SEQ2004 
SEQ2005 
SEQ2006 
SEQ2007 
SEQ2008 
SEQ2 009 
SEQ2010 



Table 20: Comparative Sequences relating to SAG1723 (signal peptidase I) 



GACCTAGCACAAAGCTCTACCGCTTTCACCACTGACAGCAATGGCAGCAGCGAATTTACT 
GAC CT AG C AC AAAGCT CTAC CGCTTTCACTACTGACAGCAATGGCAGCAGCGAATTTACT 
GACCTAGCACAAAGCTCTACCGCTTTCACCACTGACAGCAATGGCAGCAGCGAATTTACT 
GACCTAGCACAAAGCTCTACCGCTTTCACCACTGACAGCAATGGCAGCAGCGAATTTACT 
GAC CT AGC ACAAAGCTCTAC CGCTTT CAC CACTGACAGCAATGGCAGCAGCGAATTTACT 
GACCTAGCACAAAGCTCTACCGCTTTCACCACTGACAGCAATGGCAGCAGCGAATTTACT 
GACCTAGCACAAAGCTCTACCGCTTTCACTACTGACAGCAATGGCAGCAGCGAATTTACC 
GACCTAGCACAAAGCTCTACCGCTTTCACCACTGACAGCAATGGCAGCAGCGAATTTACT 
GACCTAGCACAAAGCTCTACCGCTTTCACTACTGACAGCAATGGCAGCAGCGAATTTACT 
GAC CTAGC AC AAAGCT CTAC CGC TTT CACCACTGACAGCAATGGC AGCAGCGAATTTACT 

CTGTCGTGCCTAAAGGC C ACTACTAT CTTGTTGGTGATGAC CGAATTGTCTCTAAAGAT 
CTGTCGTGCCTAAAGGCCACTATTATCTTGTTGGTGATGACCGAATTGTCTCTAAAGAT 
CTGTCGTGCCTAAAGGCCACTACTATCTTGTTGGTGATGACCGAATTGTCTCTAAAGAT 
CTGTCGTGCCTAAAGGCCACTACTATCTTGTTGGTGATGACCGAATTGTCTCTAAAGAT 
CTGTCGTGCCTAAAGGCCACTACTATCTTGTTGGTGATGACCGA 



CTGTCGTGCCTAAAGGCCACTACTATCTTGTTGGTGATGACCGAATTGTCTCTAAAGAT 

CTGT CGTGC CTAAAGGCCACTACTATCTTGT TGGTGATGACCGA 

CTGTCGTGCCTAAAGGCCACTATTATCTTGTTGGTGATGACCGAATTGTCTCTAAAGAT 
CTGTCGTGC C TAAAGGC CACTACTATCTTGTTGGTGATGAC CGAATTGTCTCTAAAGAT 

GTCGTGCCGTCGGTTCCTTCAAAA 

GTCGTGCCGTCGGTCCCTTCAAAAAATCAACAATTGTGGGAG 

GTCGTGCCGTCGGTCCCTTCAAAAAATCAACGATTGTGGGAGAGGT 

GTCGTGCCGTCGGT 



GTCGTGCCGTCGGCCCCTTCAAAAAATCAACG 



GTCGTGCCGTCGGT 

GTCGTGCCGTCGGTCCCTTCAAAAAATCAACGTABCMARATVSTNCSRATNGTSAGSGN 



TDAS 
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Table 21: Comparative Sequences relating to SAG0079 (adenylate kinase) 

SEQ ID NO. 2101: SAGO 07 9 FROM THE 2603V/R GBS TYPE V STRAIN 

AATCTTTTAATTATGGGTTTGCCTGGTGCTGGTAAAGGTACTC1AAGCAGCTAAGATCGTTGAAGAATTT 

ACAGGGGATATGTTCCGCGCCGCAATGGCTAATCAAACCGAAATGGGACGTTTAGCTAAAAGTTATAT 

CCTGATGAAGTAACAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGCAGAAAAAGGTTTTTTACTT 

CGTACTATTGAACAAGCACACGCCTTAGATGCTACGCTTGAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGAT 

CCATCMGTCTTATAGAGCGTTTGAGTGGTCGTATTATCAATCGTAAAACrG^ 

GATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCTGAAACTGTCAAACGTCGCTTGGACGTTAATATTGCTCA^ 

GAACCTATTCTTGAACACTATCGTAAGCTTGGTCTTGTTACAGATATTGAAG^ 

GAAAAAGCGTTG 

SEQ ID NO. 2102: SAG0079 FROM THE 090 GBS TYPE la STRAIN (REVERSE COMPLEMENT) 

AATCTTTTAATTATGGGTTTGCCTGGTGCTGGTAAAGGTACTCAAG 
ACAGGGGATATGTTCCGCGCCGCAATGGCTAATCAA^ 

CCTGATGAAGTAACAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGC^ 

CGTACTATTGAACAAGCA(^CGCCTTAGATGCTACGCTTGAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGA 
CCATCATGTCTTATAGAGCGTTTGAGTGGTCGTATTATCAATCGTAA^ 

GATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCTGAAACTGTCAAACGTCGC^ 

GAACCTATTCTTGAAC^CTATCGTAAGCTTGGTCTTGTTACAGATATTGAAG 

GAAAAAGCGTTGCTAGAACTCAAA 

SEQ ID NO. 2103: SAG0079 FROM THE 1169NT1 GBS TYPE V STRAIN (REVERSE COMPLEMENT) 

TGGTAAAGGGACTCAAGCAGCTAAGATTGTT 

TAATCAAACCGAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGTGAATTGGTTCCTGATCAA 

AGAGCGCTTAGCTGAGGATGATATCGCAGAAAAAGGTTTTTTACTTGATGGGTATCCACGTACTATTGAACAAGCA 

TGCTACGCTTGAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGATCCATCATGTCTTATAGAGCGTTTGAGTGG 

TCGTATTATCAATCGTAAAACTGGTGAAACTTTCCACAAAGTGTTCAACCCA 

TGAAGATGATAAGCCTGAAACTGTCAAACGTCGCTTGGA 

TGGCCTTGTTACAGATATTGAAGGTAATCAAGAAATAA 



SEQ ID NO. 2104: SAG0079 FROM THE 18RS21 GBS TYPE II STRAIN (REVERSE COMPLEMENT) 

AATCTTTTAACCACGGGTTCGCCTGGTGCTGGTAAAGGTACTCAAGC 

ACAGGGGATATGTTCCGCGCCGCAATGGCTAATCAAACCGAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGTGAATTGGTT 

CCTGATGAAGTAACAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGCAGAAAAAGGTTTTTTACTTGATGGA 

CGTACTATTGAACAAGCAC1ACGCCTTAGATGCTACGC 

CCATCATGTCTTATAGAGCGTTTGAGTGGTCGTATTATC^ 

GATTATAAAGAAGAAGATTACTATCAACGTGAAGATGAT^ 

GAACCTATTCTTGAACACTATCGTAAGCTTGGTCTTGTTACAGATATTGAAGGTAATCAAGAAATAACAGA^ 
GAAAAAGCGTTGCTAGAA 

SEQ ID NO. 2105: SAG0079 FROM THE 2603V/R GBS TYPE V STRAIN (REVERSE COMPLEMENT) 

AATCTTTTAATTATGGGTTTGCCTGGTGCTGGTAAAGGTACTCAAGCAGCTAAGATCGTTGAAGAATTTGGTGTTGCTC^CATCTCA 
ACAGGGGATATGTTCCGCGCCGCAATGGCTAATCAAAC^ 

CCTGATGAAGTAACAAACGGGATTGTAAAAGAGCGCTTAGCTGAGX3ATGATATCGCAGAAAAAGGTTTTTTACTT 

CGTACTATTGAACAAGCAC^CGCCTTAGATGCTACGCTTGAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGAT 

CCATCATGTCTTATAGAGCGTTTGAGTGGTCGTATTATCAATCGTAAAAC 

GATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCTGAAACTGTCAAACGTC^ 

GAACCTATTCTTGAACACTATCGTAAGCTTGGTCTTGTTACAGATATTGAAGGTAATCAAG^ 

GAAAAAGCGTTG 

SEQ ID NO. 2106: SAG0079 FROM THE A909 GBS TYPE la STRAIN (REVERSE COMPLEMENT) 

AATCTTTTAATTATGGGTTTGCCTGGTGCTGG 

ACAGGGGATATGTTCCGCGCCGa^TGGCTAATCAAACCGAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGTGAATTGGT^ 

CCTGATGAAGTAACAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGCAGAAAAAGGTTTTTTACTTGATGGATATCCA 

CGTACTATTGAAO^AGCACACGCCTTAGATGCTACGCTTGAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGAT 

CCATCATGTCTTATAGAGCGTTTGAGTGGTCGTATTATCAATCGTAAAACTGGTGAAACTTTCCAC^ 

GATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCTGAAACTGTCAAAro 

GAATCTATTCTTGAACACTATCGAAAGCTTGGTCTTGTTACAGATATTGAAGGTAA 

SEQ ID NO. 2107: SAG0079 FROM THE CJB110 GBS NONTYPEABLE STRAIN (REVERSE COMPLEMENT) 

AATCTTTTAACCACGGGTTTGCTTGGTGCTGGTAAAGGTACTCAAGCAGCTAAGATCGTTGAAGAATTTGGTGTTGCTCACATCTCA 

ACAGGGGATATGTTCCGCGCCGC71ATGGCTAATCAAACCGAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGTGAA 

CCTGATaA^GTAACAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGCAGAAAAAGGTTTTTTACTTGATGGATATCCA 

CGTACTATTGAACAAGCACACGCCTTAGATGCTACGCTTGAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGAT 

CC^TCATGTCTTATAGAGCGTTTGAGTGGTCGTATTATCAATCGTAAAACTGGTGA^ 

GATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCTGAAACTGTCAAACGTCGCTTGGACGTTAATATTGCTCIAAGGA 
GAACCTATTCTTGAACACTATAG 
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Table 21: Comparative Sequences relating to SAG0079 (adenylate kinase) 

SEQ ID NO. 2108: SAGO 07 9 FROM THE COH1 GBS TYPE III STRAIN (REVERSE COMPLEMENT) 

ATCTTTTAATTATGGGTTTGCCTGGTGCTGGTAAAGGTACTCA^ 

<^G<3<^ATATGTTCCGCGCCGCAATGGCTAATCAAACC 

CTGATGAAGTAACAAACGGGATTGTAAAAGAGCGCTTAGCT^ 

GTACTATTGAGCAAGCACACGCCTTAGATGCTACGCTTGAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAA 

CAACATGCCTTATAGAGCGTTTGAGTGGCCGTATTATCAATCGTAA^ 

ATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCTGAAACTGTCAAAC^ 

AACCTATTCTTGAACACTATCGTAAGCTTGGTCTTGTTAC^GATATTGAAG^ 

AAAAAGCGTTGCTAG 

SEQ ID NO. 2109: SAG0079 FROM THE H36b GBS TRYP lb STRAIN (REVERSE COMPLEMENT) 

CAGGGGATATGTTCCGCGCCGCAATGGCTAATCAAACCGAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGTGAATTGGTTC 
CTGATGAAGTAACAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATA 

GTACTATTGAACAAGCACIACGCCTTAGATGCTACGCTTGAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTG 
CATCATGTCTTATAGAGCGTTTGAGTGGTCGTATTATCAATCGTA 

ATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCTGAAACTGTCAAACGTCGCTTGGACGTTAATATT 

AATCTATTCTTGAACACTATCGTAAGCTTGGTCTTGTTACAGAT^ 

AAAAAGCGTTGCT 

SEQ ID NO. 2110: SAG0079 FROM THE JM9130013 GBS TYPE VIII STRAIN (REVERSE 
COMPLEMENT) 

AATCTTTTAATTATGGGTTTGCCTGGTGCTGGTAAAGGTACT 

ACAGGGGATATGTTCCGCGCCGCAATGGCTAATCAAACCGAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGTGAATTGGTT 
CCTGATGAAGTAAGAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATC^ 

CGTACTATTGAACAAGCACACGCCTTAGATGCTACGCTTGAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGAT 
CQATCATGTCTTATAGAGCGTTTGAGTGGTCGTATTATC^ 
GATTATAAAGAAGAAGATTACTATC^CGTGAAGATGATAAGCCTC 
GAACCTATTCTTGAACACTATAAAAAGCTTGGTCTTGTTACAGATATTGAAGGTAATCA 

SEQ ID NO. 2111: SAGO 07 9 FROM THE M732 GBS TYPE III STRAIN (REVERSE COMPLEMENT) 

CTTTTAATTATGGGTTTGCCTGGTGCTGGTAAAGGTACT 

GGGGATATGTT CCG CGCCGCAATGGCTAATCAAACCCAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGTGAATTGGT TCCT 
GATGAAGTAACAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGCAG 
ACTATTGAGCAAGCACACGCCTTAGATGCTACGCTTGAAGAAC 
ACATGCCTTATAGAGCGTTTGAGTGGCCGTATTATCAATCGTAAAACT 

TATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCTGAAACTGTCAAACGTCGCTTGGACGTTAATATTGCT 

CCTATTCTTGAACACTATCGTAAGCTTGGTCTTGTTACAGATATTGAAGGT^ 

AAAGCGTTGCTAGAACTCAAA 

SEQ ID NO. 2112: SAG0079 FROM THE M781 GBS TYPE III STRAIN (REVERSE COMPLEMENT) 

AATCTTTTAATTACGGGTTTGCCTGGTGCTGGTAAAGGTACTCAAGC^ 
ACAGGGGATATGTTCCGCGCCGCAATGGCTAATG 

CCTGATGAAGTAACAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGCAGAAAAAGGTTTTTTACTTGAT 

CGTACTATTGAGCAAGCACACGCCTTAGATGCTACGCTTGAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGAT 

CCAACATGCCTTATAGAGCGTTTGAGTGGCCGTATTATCAATCGTAAAACTGGTGAAACTTTCCACAAA 

GATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCTGAAACTGTCAAACGTCGCTTGGACGTTAATATTGCTCAA 



SEQ2 101 ATCTTTTAATTATGGGTTTGCCTGGTGCTGGTAAAGGTACTCAAGCAGCTAAGATCGTT 

SEQ2 102 ATCTTTTAATTATGGGTTTGCCTGGTGCTGGTAAAGGTACTCAAGCAGCTAAGATCGTT 

SEQ2103 TG GTAAAGGGAC T CAAG CAG C TAAGATTGTT 

SEQ2104 ATCTTTTAACCACGGGTTCGCCTGGTGCTGGTAAAGGTACTCAAGCAGCTAAGATCGTT 

SEQ2105 ATCTTTTAATTATGGGTTTGCCTGGTGCTGGTAAAGGTACTCAAGCAGCTAAGATCGTT 

SEQ2106 ATCTTTTAATTATGGGTTTGCCTGGTGCTGGTAAAGGTACTCAAGCAGCTAAGATCGTT 

SEQ2107 ATCTTTTAACCACGGGTTTGCTTGGTGCTGGTAAAGGTACTCAAGCAGCTAAGATCGTT 

SEQ2108 ATCTTTTAATTATGGGTTTGCCTGGTGCTGGTAAAGGTACTCAAGCAGCTAAGATTGTT 

SEQ2109 

SEQ2 11 0 ATCTTTTAATTATGGGTTTGCCTGGTGCTGGTAAAGGTACTCAAGCAGCTAAGATCGTT 

SEQ2111 -- CTTTTAATTATGGGTTTGCCTGGTGCTGGTAAAGGTACTCAAGCAGCTAAGATTGTT 

SEQ2 112 ATCTTTTAATTACGGGTTTGCCTGGTGCTGGTAAAGGTACTCAAGCAGCTAAGATTGTT 

SEQ2 101 AAGAATTTGGTGTTGCTCACATCTCAACAGGGGATATGTTCCGCGCCGCAATGGCTAAT 

SEQ2 102 AAGAATTTGGTGTTGCTCACATCTCAACAGGGGATATGTTCCGCGCCGCAATGGCTAAT 

SEQ2 10 3 AAGAATTTGGTGTTGCGCACATCTCAACAGGGGATATGTTCCGCGCCGCAATGGCTAAT 

SEQ2104 AAGAATTTGGTGTTGCTCACATCTCAACAGGGGATATGTTCCGCGCCGCAATGGCTAAT 

SEQ 2 105 AAGAATTTGGTGTTGCTCACATCTCIAACAGGGGATATGTTCCGCGCCGCAATGGCTAAT 

SEQ2106 AAGAATTTGGTGTTGCTCACATCTCAACAGGGGATATGTTCCGCGCCGCAATGGCTAAT 

SEQ2107 AAGAATTTGGTGTTGCTCACATCTCAACAGGGGATATGTTCCGCGCCGCAATGGCTAAT 

SEQ2 10 8 AAGAATTTGGTGTTGCTCACATCTCAACAGGGGATATGTTCCGCGCCGCAATGGCTAAT 
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SEQ2109 
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SEQ2111 
SEQ2112 

SEQ2101 
SEQ2102 
SEQ2103 
SEQ2104 
SEQ2105 
SEQ210 6 
SEQ2107 
SEQ2108 
SEQ2109 
SEQ2110 
SEQ2111 
SEQ2112 

SEQ2101 
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SEQ2109 
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SEQ2111 
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CAGGGGATATGTTCCGCGCCGCAATGGCTAAT 

AAGAATTTGGTGTTGCTCACATCTCAAC^GGGGATATGTTCCGCGCCGCAATGGCTAAT 

AAGAATTTGGTGTTGCTCAC^TCTCAACAGGGGATATGTTCCGCGCCGCAATGGCTAAT 
AAGAATTTGGTGTTGCT^C^TCTOVACAGGGGATATGTTCCGCGCCGCAATGGCTAAT 

CAAACCGAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGTGAATTGGTTCCTGAT 
CAAACCGAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGTGAATTGGTTCCTGAT 
CAAACCGAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGTGAATTGGTTCCTGAT 
CAAACCGAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGTGAATTGGTTCCTGAT 
CAAACCGAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGTGAATTGGTTCCTGAT 
CAAACCGAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGTGAATTGGTTCCTGAT 
CAAACCGAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGTGAATTGGTTCCTGAT 
CAAACC CAAATGGGACGTTTAGCTAAAAGTTATAT TGATAAAGGTGAATTGGTTCCTGAT 
CAAACCGAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGTGAATTGGTTCCTGAT 
CAAACCGAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGTGAATTGGTTCCTGAT 
CAAACCCAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGTGAATTGGTTCCTGAT 
CAAACCCAAATGGGACGTTTAGCTAAAAGTTATATTGATAAAGGTGAATTGGTTCCTGAT 

AAGTAACAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGCAGAAAAAGGT 
AAGTAACAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGCAGAAAAAGGT 
AAGTAA CAAACGGGAT TGTAAAAGAG CG CT TAGCTGAGGATGATAT CG CAGAAAAAGGT 
AAGTAACAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGCAGAAAAAGGT 
AAGTAACAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGCAGAAAAAGGT 
AAGTAACAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGCAGAAAAAGGT 
AAGTAACAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGCAGAAAAAGGT 
AAGTAACAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGCAGAAAAAGGT 
AAGTAACAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGCAGAAAAAGGT 
AAGTAA CAAACGGGAT TGTAAAAGAG C G CT T AG C TGAG GATGATAT CG CAGAAAAAGGT 
AAGTAA CAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGCAGAAAAAGGT 
AAGTAACAAACGGGATTGTAAAAGAGCGCTTAGCTGAGGATGATATCGCAGAAAAAGGT 

TTTTTACTTGATGGATATCCACGTACTATTGAACAAGCACACGCCTTAGATGCTACGCTT 
TTTTTACTTGATGGATATCCACGTACTATTGAACAAGCACACGCCTTAGATGCTACGCTT 
TTTTTACTTGATGGGTATCCACGTACTATTGAACAAGCACACGCCTTAGATGCTACGCTT 
TTTTTACTTGATGGATATCCACGTACTATTGAACAAGCACACGCCTTAGATGCTACGCTT 
TTTTTACTTGATGGATATCCACGTACTATTGAACAAGCACACGCCTTAGATGCTACGCTT 
TTTTTACTTGATGGATATCCACGTACTATTGAACAAGCACACGCCTTAGATGCTACGCTT 
TTTTTACTTGATGGATATCC^CGTACTATTGAACAAGCACACGCCTTAGATGCTACGCTT 
TTTTTACTTGATGGATATCCACGTACTATTGAGCAAGCACACGCCTTAGATGCTACGCTT 
TTTTTACTTGATGGATATCCACGTACTATTGAACAAGCACACGCCTTAGATGCTACGCTT 
TTTTTACTTGATGGATATCCACGTACTATTGAACAAGCACACGCCTTAGATGCTACGCTT 
TTTTTACTTGATGGATATCCACGTACTATTGAGCAAGCACACGCCTTAGATGCTACGCTT 
TTTTTACTTGATGGATATCCACGTACTATTGAGCAAGCACACGCCTTAGATGCTACGCTT 

GAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGATCCATCATGTCTT 
GAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGATCCATCATGTCTT 
GAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGATCCATCATGTCTT 
GAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGATCCATCATGTCTT 
GAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGATCCATCATGTCTT 
GAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGATCCATCATGTCTT 
GAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGATCCATCATGTCTT 
GAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGATCCAACATGCCTT 
GAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGATCCATCATGTCTT 
GAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGATCCATCATGTCTT 
GAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGATCCAACATGCCTT 
GAAGAACTAGGACTACGCTTAGATGGTGTTATTAATATTAAAGTGGATCCAACATGCCTT 

ATAGAGCGTTTGAGTGGTCGTATTATCAATCGTAAAACTGGTGAAACTTTCCACAAAGTG 
ATAGAGCGTTTGAGTGGTCGTATTATCAATCGTAAAACTGGTGAAACTTTCCACAAAGTG 
ATAGAGCGTTTGAGTGGTCGTATTATCAATCGTAAAACTGGTGAAACTTTCCACAAAGTG 
ATAGAGCGTTTGAGTGGTCGTATTATCAATCGTAAAACTGGTGAAACTTTCCACAAAGTG 
ATAGAGCGTTTGAGTGGTCGTATTATCAATCGTAAAACTGGTGAAACTTTCCACAAAGTG 
ATAGAGCGTTTGAGTGGTCGTATTATCAATCGTAAAACTGGTGAAACTTTCCACAAAGTG 
ATAGAGCGTTTGAGTGGTCGTATTATaAATCGTAAAACTGGTGAAACTTTCCACAAAGTG 
ATAGAGCGTTTGAGTGGCCGTATTATCAATCGTAAAACTGGTGAAACTTTCCACAAAGTG 
ATAGAGCGTTTGAGTGGTCGTATTATCAATCGTAAAACTGGTGAAACTTTCCACAAAGTG 
ATAGAGCGTTTGAGTGGTCGTATTATCAATCGTAAAACTGGTGAAACTTTCCACAAAGTG 
ATAGAGCGTTTGAGTGGCCGTATTATCAATCGTAAAACTGGTGAAACTTTCCACAAAGTG 
ATAGAGCGTTTGAGTGGCCGTATTATCAATCGTAAAACTGGTGAAACTTTCCACAAAGTG 
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Table 21: Comparative Sequences relating to SAG0079 (adenylate kinase) 



SEQ2X01 
SEQ2102 
SEQ2103 
SEQ2104 
SEQ2105 
SEQ210 6 
SEQ2107 
SEQ2108 
SEQ210 9 
SEQ2110 
SEQ2111 
SEQ2112 

SEQ2101 
SEQ2102 
SEQ2103 
SEQ2104 
SEQ2105 
SEQ2106 
SEQ2107 
SEQ210 8 
SEQ210 9 
SEQ2110 
SEQ2111 
SEQ2112 

SEQ2101 
SEQ2102 
SEQ2103 
SEQ2104 
SEQ2105 
SEQ2106 
SEQ2107 
SEQ2108 
SEQ210 9 
SEQ2110 
SEQ2111 
SEQ2112 

SEQ2101 
SEQ2102 
SEQ2103 
SEQ2104 
SEQ2105 
SEQ2106 
SEQ2107 
SEQ2108 
SEQ2X09 
SEQ2110 
SEQ2111 
SEQ2112 



TTCAACCCACCAGTAGATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCT 
TTC^^CCCACCAGTAGATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCT 
TTCAACCCACCAGTAGATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCT 
TTCAACCCACCAGTAGATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCT 
TTCAACCCACCAGTAGATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCT 
TTCAACCCACCAGTAGATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCT 
TTCAACCCACCAGTAGATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCT 
TTCAACCCACCAGTAGATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCT 
TTCAACCCACCAGTAGATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCT 
TTCAACCCACCAGTAGATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCT 
TTCAACCCACCAGTAGATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCT 
TTCAACCCACCAGTAGATTATAAAGAAGAAGATTACTATCAACGTGAAGATGATAAGCCT 

GAAACTGTCAAACGTCGCTTGGACGTTAATATTGCTCAAGGAGAACCTATTCTTGAACAC 
GAAACTGTCAAACGTCGCTTGGACGTTAATATTGCTCAAGGAGAACCTATTCTTGAACAC 
GAAAGTGTCAAACGTCGCTTGGACGTTCATATTGCTCAAGGAGAACCTATTCTTGAACAC 
GAAACTGTCAAACGTCGCTTGGACGTTAATATTGCTCAAGGAGAACCTATTCTTGAACAC 
GAAACTGTCAAACGTCGCTTGGACGTTAATATTGCTCAAGGAGAACCTATTCTTGAACAC 
GAAACTGTCAAACGTCGCTTGGACGTTAATATTGCTCAAGGAGAATCTATTCTTGAACAC 
GAAACTGTCAAACGTCGCTTGGACGTTAATATTGCTCAAGGAGAACCTATTCTTGAACAC 
GAAACTGTCAAACGTCGCTTGGACGTTAATATTGCTCAAGGAGAACCTATTCTTGAACAC 
GAAACTGTCAAACGTCGCTTGGACGTTAATATTGCTCAAGGAGAATCTATTCTTGAACAC 
GAAACTGTTAAACGTCGCTTGGACGTTAATATTGCTCAAGGAGAACCTATTCTTGAACAC 
GAAACTGTCAAACGTCGCTTGGACGTTAATATTGCTCAAGGAGAACCTATTCTTGAACAC 
GAAACTGTCAAACGTCGCTTGGACGTTAATATTGCTCAATABCMARATVSTNCSR AT 

ATCGTAAGCTTGGTCTTGTTACAGATATTGAAGGTAATCAAGAAATAACAGAAGTTTTT 
ATCGTAAGCTTGGTCTTGTTACAGATATTGAAGGTAATCAAGAAATAACAGAAGTTTTT 

ATAGTAAGCTTGGCCTTGTTACAGATATTGAAGGTAATCAAGAAATAA 

ATCGTAAGCTTGGTCTTGTTACAGATATTGAAGGTAATCAAGAAATAACAGAAGTTTTT 
ATCGTAAGCTTGGTCTTGTTACAGATATTGAAGGTAATCAAGAAATAACAGAAGTTTTT 

ATCGAAAGCTTGGTCTTGTTACAGATATTGAAGGTAA 

ATAG 

ATCGTAAGCTTGGTCTTGTTACAGATATTGAAGGTAATCAAGAAATAACAGAAGTTTTT 
ATCGTAAG CTT GGT CT TGTTACAGATAT TGAAGGTAAT CAAGAAATAA CAGAAGT T T T T 

ATAAAAAGCTTGGTCTTGTTACAGATATTGAAGGTAATCA 

ATCGTAAGCTTGGTCTTGTTACAGATATTGAAGGTAATCAAGAAATAACAGAAGTTTTT 
GTSAGADNYATKNAS 

CAGATGTTGAAAAAGCGTTG 

CAGATGTTGAAAAAGCGTTGCTAGAACTCAAA 



CAGATGTTGAAAAAGCGTTGCTAGAA 
CAGATGTTGAAAAAGCGTTG 



CAGATGTTGAAAAAGCGTTGCTAG 
CAGATGTTGAAAAAGCGTTGCT - - 



CAGATGTTGAAAAAGCGTTGCTAGAACTCAAA 
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Table 21: Comparative Sequences relating to SAG0079 (adenylate kinase) 

>SEQ IB NO 2150:090 frame: 1 

NLLIMGLPGAGKGTQAAKIVEEFGVAHISTGDMFRAAMANQTEMGRIxAKSYIDKGELVPD 
EVTNGIVKERLAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPSCL 
IERIjSGRIXNRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRIjDVNIAQGEPILEH 
YRKLGLVTDI EGNQEI TEVFADVEKALLELK 

>SEQ ID NO 2151:114_1169NT frame: 2 

GKGTQAAKIVEEFGVAHISTGDMFRAAMANQTEMGRLAKSYIDKGELVPDQVTNGIVKER 
IiAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPSCLIERLSGRIIN 
RKTGETFHKVFNPP VD YKEEDY YQREDDKPETVKRRLDVHIAQGEP ILEHYSKLGLVTD I 
EGNQEI 

>SEQ ID NO 2152: 114__18RS21 frame: 1 

NLLTTGSPGAGKGTQAAKIVEEFGVAHISTGDMFRAAMANQTEMGRLAKSYIDKGEIiVPD 

EVTNGIVKERLAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPSCL 

IERLSGRI INRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLDVNIAQGEPILEH 
YRKLGLVTDI EGNQE I TEV FADVE KALLE 

>SEQ ID NO 2153: 114_2603 frame: 1 

NLLIMGLPGAGKGTQAAKIVEEFGVAHISTGDMFRAAMANQTEMGRLAKSYIDKGELVPD 
EVTNGI VKERLAEDDI AEKGFLLDG YPRTI EQAHALDATLEELGLRLDGVI NI KVDPS CL 
I ERLSGRI INRKTGETFHKVFNPPVDYKEEDYYQREDDKPEWKRRLDVNI AQGEPILEH 
YRKLGLVTDI EGNQE I TEVFADVEKAL 

>SEQ ID NO 2154: 114_A909 frame: 1 

NLLIMGLPGAGKGTQAAKIVEEFGVAHISTGDMFRAAMANQTEMGRLAKSYIDKGELVPD 

EVTNGIVKERIAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPSCL 

IERLSGRI INRKTGETFHKVFNPPVD YKEEDYYQREDDKPETVKRRLDVNIAQGES ILEH 
YRKLGLVTDI EG 

t 

>SEQ ID NO 2155:114_A909 frame: 1 

NLL IMGLPGAGKGTQAAKI VEE FGVAH I S TGDMFRAAMANQTEMGRLAKS Y I DKGELVPD 
EVTNGIVKERLAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPSCL 
IERLSGRI INRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLDVNIAQGESILEH 
YRKLGLVTDI EG 

>SEQ ID NO 2156: 114_CJB110 frame: 1 

NLLTTGLLGAGKGTQAAKIVEEFGVAHISTGDMFRAAMANQTEMGRLAKSYIDKGELVPD 

EVTNGIVKERLiAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINI KVDPS CL 

IERLSGRI INRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLDVNI AQGEPILEH 
Y 

>SEQ ID NO 2157: 114_COHl frame: 3 

LLIMGLPGAGKGTQAAKIVEEFGVAHISTGDMFRAAMANQTQMGRLAKSYIDKGELVPDE 
VTNGIVKERLAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPTCLI 
ERLSGRI INRKTGETFHKVFNPPVD YICEED YYQREDDKPETVKRRLDVNIAQGEP ILEHY 
RKLGLVTD I EGNQE I TEVFADVEKALL 

>SEQ ID NO 2158: 114JH36B frame: 3 

GDMFRAAMANQTEMGRLAKSYIDKGELVPDEVTNGIVKERLAEDDIAEKGFLLDGYPRTI 
EQAHALDATLEELGLRLDGVINIKVDPSCLIERLSGRI INRKTGETFHKVFNPPVD YKEE 
DYYQREDDKPETVKRRLDVNIAQGESILEHYRKLGLVTDIEGNQEITEVFADVEKAL 

>SEQ ID NO 2159: 114_JM9130013 frame: 1 

NLLIMGLPGAGKGTQAAKIVEEFGVAHISTGDMFRAAMANQTEMGRLAKSYIDKGELVPD • 
EVTNGI VKERLAEDDI AEKGFLLDGYPRTI EQAHALDATLEELGLRLDGV INI KVDPSCL 

I ERLSGRI INRKTGETFHKVFNPPVDYKEED YYQREDDKPETVKRRLDVNIAQGEP ILEH 
YKKLGLVTDI EGN 

>SEQ ID NO 2160:114_M732 frame: 1 

LL I MGL PGAGKGTQAAKI VEEFGVAH I STGDMFRAAMANQTQMGRLAKS YI DKGELVPD E 

VTNGI VKERLAEDD I AEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPTCL I 

ERLSGRI INRKTGETFHKVFNP PVDYKEED YYQREDDKPETVKRRLDVNI AQGEPI LEHY 
RKLGLVTDIEGNQEITEVFADVEKALLELK 

>SEQ ID NO 2161: 114_M7 81 frame: 1 

NLL I TGLPGAGKGTQAAKI VEEFGVAH I S TGDMFRAAMANQTQMGRLAKS Y I DKGELVPD 
EVTNGIVKERLAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPTCL 
I ERLS GRI I NRKTGET FHKVFNP PVDYKEED YYQR EDDKPETVKRRLDVNIAQ 
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Table 21: Comparative Sequences relating to SAG0079 (adenylate kinase) 



SEQ2 15 0 LLIMGLPGAGKGTQAAKIVEEFGVAHISTGDMFRAAMANQTEMGRLAKSYIDKGELVPD 

SKQ2 151 GKGTQAAKIVEEFGVAHISTGDMFRAAMANQTEMGRLAKS YIDKGELVPD 

SEQ2 152 LLTTGS PGAGKGTQAAKI VEEFGVAHI STGDMFRAAMANQTEMGRLAKS YIDKGELVPD 

SEQ2153 LLIMGL PGAGKGTQAAKI VEEFGVAHI STGDMFRAAMANQTEMGRLAKS YIDKGELVPD 

SEQ2154 LLIMGLPGAGKGTQAAKI VEEFGVAHI STGDMFRAAMANQTEMGRLAKS YIDKGELVPD 

SEQ2155 LLIMGLPGAGKGTQAAKI VEEFGVAHI STGDMFRAAMANQTEMGRLAKS YIDKGELVPD 

SEQ2156 LLT TGLLGAGKGTQAAKI VEE FGVAHI STGDMFRAAMANQTEMGRLAKS YI DKGELVPD 

SEQ2157 LL I MGL PGAGKGTQAAKI VEEFGVAHI STGDMFRAAMANQTQMGRLAKSYI DKGELVPD 

SEQ2158 GDMFRAAMANQTEMGRLAKSYI DKGELVPD 

SEQ2 15 9 LLIMGLPGAGKGTQAAKI VEEFGVAHISTGDMFRAAMANQTEMGRLAKSYIDKGELVPD 

SEQ2 160 LLIMGLPGAGKGTQAAKIVEEFGVAHI STGDMFRAAMANQTQMGRLAKSYI DKGELVPD 

SEQ2161 LL I TGLPGAGKGTQAAKIVEEFGVAHI STGDMFRAAMANQTQMGRLAKSYI DKGELVPD 

SEQ2 15 0 EVTNGIVKERLAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPSCL 

SEQ2 151 QVTNGIVKERIiAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPSCL 

SEQ2 15 2 EWNGIVKERLAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPSCL 

SEQ2 15 3 EVTNGIVKERLAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPSCL 

SEQ2154 EVTNGIVKERLAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPSCL 

SEQ2155 EVTNGIVKERIiAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPSCL 

SEQ2 15 6 EVTNGIVKERLAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIK\TDPSCL 

SEQ2157 EVTNGIVKERLAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPTCL 

SEQ2158 EVTNGIVKERLAEDDIAEKGFLLDGYPRTIEQAHAIiDATLEELGLRLDGVINIKVDPSCL 

SEQ2159 EVTNGIVKERLAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPSCL 

SEQ216 0 EVTNGIVKERLAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPTCL 

SEQ2161 EVTNGIVKERLAEDDIAEKGFLLDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPTCL 

SEQ2150 I ERLSGRI INRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLDVNIAQGEPILEH 

SEQ2151 I ERLSGRI INRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLDVHI AQGEPI LEH 

SEQ2 152 I ERLSGRI INRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLDVNIAQGEPILEH 

SEQ2 15 3 I ERLSGRI INRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLDVNI AQGEPILEH 

SEQ2 154 IERLSGRI INRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLDVNIAQGESILEH 

SEQ2 15 5 IERLSGRI INRKTGETFHKVFNPPVD YKEEDYYQREDDKPEWKRRLDVNIAQGES ILEH 

SEQ2 15 6 IERLSGRI INRKTGETFHKVFNPPVD YKEEDYYQREDDKPETVKRRLDVNI AQGEP I LEH 

SEQ2 157 IERLSGRI INRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLDVNI AQGEPILEH 

SEQ215 8 IERLSGRI INRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLDVNIAQGESILEH 

SEQ2159 IERLSGRI INRKTGETFHKVFNPPVD YKEEDYYQREDDKPETVKRRLDVNI AQGEP ILEH 

SEQ2160 I ERLSGRI INRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLDVNI AQGEPILEH 

SEQ2 16 1 IERLSGRI INRKTGETFHKVFNPPVDYKEEDYYQREDDKPETVKRRLDVNIAQ 

SEQ2 15 0 RKLGLVTD IEGNQE ITEVFADVEKALLELK 

SEQ2151 SKLGLVTD IEGNQE I 

SEQ2 15 2 RKLGLVTD I EGNQE ITEVFADVEKALLE - - 

SEQ2153 RKLGLVTD I EGNQE I TEVFADVEKAL 

SEQ2154 RKLGLVTD I EG 

SEQ2155 RKLGLVTDIEG 

SEQ2156 

SEQ2157 RKLGLVTD I EGNQE I TEVFADVEKALL 

SEQ2 15 8 RKLGLVTD I EGNQE I TEVFADVEKAL 

SEQ2159 KKLGLVTD I EGN 

SEQ2160 RKLGLVTD I EGNQE ITEVFADVEKALLELK 

SEQ2161 
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Table 22: Comparative Sequences relating to SAG0093 
(D-alanyl-D-alanine carboxypeptidase family protein) 

SEQ ID NO. 2201: SAG0093 FROM THE 090 GBS TYPE la STRAIN (REVERSE COMPLEMENT) 

AAGCCTAACAGTCAACAATCATCATCT CAAAAGTTGAGGAATG^ 
CAATTACCAGCTGTATCATCAAAA 

CCTGTTGAAAATATTTATTTGGATAAACGTATTACGAAGC2AAGCTACTCAGTTTTTAGA 

CATTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGGAGAAGTTGTTC^^ 

TTGACGAGGGGACAAGCAGAAAAGTTGGTAAAAACTTACTCTCAG^ 

ATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAGTCAGTCAGTTGAAAAAGATAGCTCCACAATATGGTTTTGTCTTA 

CGGTTTCCGGATGGTAAAACAGCAGAAACAGGGGTAGGTTATGAAGATTGGCATTACCGCTATGTTGGGGTA 

ATGGCCAAACATCATTTAACATTAGAAGAATACATAACTTTATTAAAGGAGAATAACCAA 

SEQ ID NO. 2202: SAG0093 FROM THE 1169NT1 GBS TYPE V STRAIN (REVERSE COMPLEMENT ) 

AAGCCTAACAGTCAACAATCATCACCTCAAAAGTTGAGGAATGAGGA 

CGATTACCAGCTGTATCATCAAAAGATTGGAACTTGATTTTGGTGAATCGTGACCATAAACATGAAGAATTAAGTCC^ 
CCTGTTGAAAATATTTATTTGGATAAACGTATTACGAAGCAAGCTACTC^ 

CATTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGGAGAAGTTGTTCAATTCTTAT^ 
TTGACGAGGGGACAAGCAGAAAAGTTGGTAAAAACTTACTCTCAGCCTGGkGGTG 

ATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAGTCAGTCAGTTGAAAAAG^TAGCTCCACAATA 
CGGTTTCCGGATGGTAAAACAGCAGAAACAGGGGTAGGTTATGA^ 

ATGGCCGAACATCGTTTAACATTAGAAGAATACATAACTTTATTAAAGGAGAATAACCAA 
SEQ ID NO. 2203: SAG0093 FROM THE 18RS21 GBS TYPE II STRAIN 

AAGCCTAACAGTCAACAATCAT CATCTCAAAAGTTGAGGAATGAGGATATAAAAAAGATATCCT CTCAAAAAAGAAATAAGAAATTA 

CAATTACCAGCTGTATCATCAAAAGATTGGAACTTGATTTTGGTCAATCGTGACCAT^ 

CCTGTTGAAAATATTTATTTGGATAAACGTATTACGAAGCAAGCTACTCAGTTTTTAG^ 

CATTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGGAGAAGTTGTTCAATTCTTATGTTACTCAAGAG^ 

TTGACGAGGGGACAAGCAGAAAAGTTGGTAAAAACTTACTCTCAGCCTGC^ 

ATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAGTCAGTCAGTTGAAAAAGATAGCTCCACAATATGGTTTTGTCTTA 
CGGTTTCCGGATGGTAAAACAGCAGAAACAGGGGTAGGTTATGAA 

ATGGCCAAACATCATTTAACATTAGAAaAATACATAACTTTATTAAAGGAGAATAACCAA 

SEQ ID NO. 2204: SAG0093 FROM THE 2603V/R GBS TYPE V STRAIN 

ACAGTCAACAATCATCATCTCAAAAGTTGAGGAATGAGGATATAAAA 

CAGCTGTATCATCAAAAGATTGGAACTTGATTTTGGTCAATCGT 

AAAATATTTATTTGGATAAACGTATTACGAAGCAAGCT^ 

TTTCGGGTTATCGTAGTGTTGCCTATCAGGAGAAGTTGTTCAATTCTTATGTTACTCAAG^ 
GGGGACAAGC^GAAAAGTTGGTAAAAACTTACTCTCAGCCTGCIAGGTGCTAGTGAACA 
CTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAGTCIAGTCAGTTGAAAA 
CGGATGGTAAAACAGCAGAAACAGGGGTAGGTTATGAAGATTGGCATTACCGCTATGTT 

AACATC!ATTTAACATTAGAAGAATACATAACTTTATTAAAGGAGAATAACCAAAACCCAGCTTTCTTGTACAA 

SEQ ID NO. 2205: SAGO 093 FROM THE A909 GBS TYPE la STRAIN 

AA.GCCTAACAGTCAACAATCATCATCTCAAAAGTTGAGGAATGAGGATATAAAAAA 
CGATTACCAGCTGTATCATCAAAAGATTG£?AACTTGATTT^ 

CCTGTTGAAAATATTTATTTGGATAAACGTATTACGAAGCAAGCTACTCAGTTTTTAGAGGCTGCTAGA 

CATTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGGAGAAGTTGTTC^^ 

TTGACGAAGGAACAAGCAGAAAAGTTGGTAAAAACTTACTCTCAGCCTGCAGGTGCT^ 

ATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAGTCAGTCAGTTGAAAAAGATAGCTCCACAATATGGTTTTGTCTTA 

CGGTTTCCGGATGGTAAAAC^GCAGAAACAGGGGTAGGTTATGAAGATTGGCATTACCGCTATGT 

ATGGCCAAACATCATTTAACATTAGAAGAATACATAACTTTATTAAAGGAGAATAACCAA 

SEQ ID NO. 2206: SAG0093 FROM THE CJB110 GBS NONTYPEABLE STRAIN 

AAGCCTAACAGTCAACAATCATCATCTCAAAAGTTGAGGAATGAGGATATAAA 

ACAATTACCAGCTGTATCATCAAAAGATTGGAACTTGATTTTGGTCAAT^ 

TCCTGTTGAAAATATTTATTTGGATAAACGTATTACGAAGCAAGCTACTCAGTTTTTAG 

ACATTTAATTTCGGGTTATCGTAGTGTTGCCTATC^GGAGAAGTTGTTCAATTCTTATGTTACTCAAGAGATGACTAGTAACCCTAA 
TTTGACGAGGGGACAAGCAGAAAAGTTGGTAAAAACTTACTCTCAGCCTGCAGGTGCTAGTGAACACC^ 

TATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAGTCAGTCAGTTGAAAAAGATAGCTCCACAATATGGTTTTGTCTT 
ACGGTTTCCGGATGGTAAAACAGC^GAAACAGGGGTAGGTTATGAAGATTGGCATTACCGCTATGTTGGGGTAGAGTCTGCAAAATA 
TATGGCCAAACATCATTTAACATTAGAAGAATACATAACTTTATTAAAGGAGAATAACCAA 



580 



WO 2004/018646 



PCT/US2003/026827 



Table 22: Comparative Sequences relating to SAG0093 
(D-alanyl-D-alanine carboxypeptidase family protein) 

SEQ ID NO. 2207: SAGO 093 FROM THE COH1 GBS TYPE III STRAIN 

CCTAACAGTC!AA CAAT CAT CAT CTCAAAAGTTGAGGAATC 

ATTACCAGCTGTATCATCAAAAGATTGGAACTTGATTTTGGTCIAATCGTG^ 

TGTTGAAAATATTTATTTGGATAAACGTATTACGAAGC^GCTACTC^ 

TTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGGAGAAGTTGTT^ 

GACGAGGGGACAAGCAGAAAAGTTGGTAAAAACTTACTCT 

GAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAGTCAGTCAGTTGAAAAAGATAGCTCC^CAATATGGTTTTGTC 
GTTTCCGGATGGTAAAAC^GCAGAAACAGGGGTAGGTTATGAAGATTGGC^ 

GGTCAAACATCATTTAACATTAGAAGAATACATAACTTTATTAAAGGAGAAT^ 

SEQ ID NO. 2208: SAG0093 FROM THE H36b GBS TYPE lb STRAIN 

AAGCCTAACAGTCAACAATCATCATCTCAAAAG 
CGATTACCAGCTGTATCATCAAAAGATT^ 

CCTGTTGAAAATATTTATTTGGATAAACGTATTACGAAGCAAGCTACT 
CATTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGGAGAAGTTGTTC^ 
TTGACGAAGGAACAAGCAGAAAAGTTGGTAAAAACTTACTCTCAGCCTGC^ 
ATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAGTCAGTC^ 

CGGTTTCCGGATGGTAAAAC^GCAGAAACAGGGGTAGGTTATGAAGATTGGCATTACCGCTATGTT 
ATGGCCAAACATCAT T TAACAT TAGAAGAATA CATAAC T T TAT TAAAGGAGAATAAC CAA 

SEQ ID NO. 2209: SAG0093 FROM THE JM9130013 GBS TYPE VIII STRAIN 

AAGCCTAACAGTCAACAATCATCATCTCAAAAGTTGAGGAA 

CAATTACCAGCTGTATCATCAAAAGATTGGAACTTGATTTTG^ 

CCTGTTGAAAATATTTATTTGGATAAACGTATTACGAAGC^^ 

CATTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGGAGAAGTTGTTCAATTCTTATGTTACTCAAGAGA 
TTGACGAGGGGAGAAGCAGAAAAGTTGGTAAAAACTTACTCTCAGCCTGC^ 

ATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAGTCAGTCAGTTGAAAAAGATAGCTCCACAATATGGTTTTGTCTTA 

CGGTTTCCGGATGGTAAAACAGCAGAAACAGGGGTAGGTTATGAAGATTGGCATTACCGCTATGTTG 

ATGGCCAAACATCATTTAACATTAGAAGAATACATAACTTTATTAAAGGAGAATAACCAA 

SEQ ID NO. 2210: SAG0093 FROM THE M732 GBS TYPE III STRAIN 

AGCCTAACAGTCAACAATCATCATCTCAAAAGTTC 
GATTACCAGCTGTATCATCAAAAGATTGGAACTTGAT 

CTGTTGAAAATATTTATTTGGATAAACGTATTACGAAGCAAGCTACTCAGTTT^ 

ATTTAAT^TCGGGTTATCGTAGTGTTGCCTATCAGGAGAAGTTGTTCAATTCT^ATGTTACTCAAGAGATGACTA 
TGACGAGGGGACAAGCAGAAAAGTTGGTAAAAACTTACTCTCAGCCTGCAGGTGCTAGTGAACACCAG^ 

TGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAGTCAGTCAGTTGAAAAAGATAGCTCCACAATATGGTTTTGTCTTAC 

GGTTTCCGGATGGTAAAACAGCAGAAACAGGGGTAGGTTATGAAGATTGGCATTACCGCTATGT 

TGGTCAAACIATC^TTTAACATTAGAAGAATACATAACTTT^ 

SEQ ID NO. 2211: SAG0093 FROM THE M781 GBS TYPE III STRAIN 

AAGCCTAACAGTC1AACAATCATCATCTCAAAAGTTGAGGAATGA 

CGATTACCAGCTGTATCATCAAAAGATTGGAACTTGATTTTGGTCAATCGTGACCATAAACATGAAG 

CCTGTTGAAAATATTTATTTGGATAAACGTATTACGAAGCAAGCTACTCAGTT^ 

CATTTAATTTCGGGTTATCGTAGTGTTGCCTATC^GGAGAAGTTGTTC^ 

TTGACGAGGGGACAAGCAGAAAAGTTGGTAAAAACTTACTCTCAGCCTGCAGGTGCTAGTGAACACCA 

ATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAGTCAGTCAGTTGAAAAAGATAGCTCCACAATATGGTTTTGTCTTA 

CGGTTTCCGGATGGTAAAACAGCAGAAACAGGGGTAGGTTATGAAGATTGGCATTACCGCTATGTTGGGG 

ATGGTCAAACATCATTTAACATTAGAAGAATACATAACTTTATTAAAGGAGAATAACCAA 



SEQ2 201 AGCCTAACAGTCAACAATCATCATCTCAAAAGTTGAGGAATGAGGATATAAAAAAGATA 

SEQ2 202 AGCCTAACAGTCAACAATCATCACCTCAAAAGTTGAGGAATGAGGATATAAAAAAGATA 

SEQ2 203 AGCCTAACAGTCAACAATCATCATCTCAAAAGTTGAGGAATGAGGATATAAAAAAGATA 

SEQ2204 ACAGT CAA CAAT CAT CAT CT CAAAAGT TGAGGAATGAGGATATAAAAAAGATA 

SEQ2 205 AGCCTAACAGTCAACAATCATCAT CTCAAAAGTTGAGGAATGAGGATATAAAAAAGACA 

SEQ2 206 AGCCTAAC^GTCAACAATCATCATCTCAAAAGTTGAGGAATGAGGATATAAAAAAGATA 

SEQ2 207 - - CCTAACAGTCAACAATC^TC^TCTaiAAAGTTGAGGAATGAGGATATAAAAAAGAC^ 

SEQ2 208 AGCCTAAC^GTCAA(^^TCATCATCTCAAAAGTTGAGGAATGAGGATATAAAAAAGACA 

SEQ2 209 AG C CTAACAGT CAA CAAT CAT CAT CT CSAAAGT TGAGGAATGAGGATATAAAAAAGATA 

SEQ2 210 AGCCTAACAGTCAACAATCATGATCTCAAAAGTTGAGGAATGA 

SEQ2 211 AG C C TAACAGT CAA CAAT CAT CAT CT CAAAAGT TGAGGAATGAGGATATAAAAAAGACA 
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Table 22: Comparative Sequences relating to SAG0093 
(D-alanyl-D-alanine carboxypeptidase family protein) 



SEQ2201 
SEQ2202 
SEQ2203 
SEQ2204 
SEQ22 05 
SEQ22 0 6 
SEQ2207 
SEQ2208 
SEQ2209 
SEQ2210 
SEQ2211 

SEQ22 01 
SEQ2202 
SEQ2203 
SEQ2204 
SEQ22 05 
SEQ22 06 
SEQ2207 
SEQ2208 
SEQ220 9 
SEQ2210 
SEQ2211 

SEQ2201 
SEQ2202 
SEQ2203 
SEQ2204 
SEQ2205 
SEQ22 06 
SEQ22 07 
SEQ22 08 
SEQ2209 
SEQ2210 
SEQ2211 

SEQ2201 
SEQ22 02 
SEQ2203 
SEQ2204 
SEQ22 05 
SEQ2206 
SEQ2207 
SEQ2208 
SEQ2209 
SEQ2210 
SEQ2211 

SEQ2201 
SEQ2202 
SEQ2203 
SEQ22 04 
SEQ2205 
SEQ22 06 
SEQ2207 
SEQ220 8 
SEQ2209 
SEQ2210 
SEQ2211 



TCCTCTCAAAAAAGAAAT -AAGAAATT -ACAATTACCAGCTGTATCATCAAAAGATTGGA 
TCCTCTCAAAAAAGAAAT - AAGAAATT - ACGATTACCAGCTGTATCATCAAAAGATTGGA 
TCCTCTCAAAAAAGAAAT - AAGAAATT - ACAATTACCAGCTGTATCATCAAAAGATTGGA 
TCCTCTCAAAAAAGAAAT-AAGAAATT-AC^AATTACCIAGCTGTATC^ 
TCCTCTCAAAAAAGAAAT - AAGAAATT - ACGATTACCAGCTGTATCATCAAAAGATTGGA 
TCCTCTCAAAAAAGAAAT -AAGAAATTTACAATTACCAGCTGTATCATCAAAAGATTGGA 
TCCTCTCAAAAAAGAAATTAAGAAATT-ACGATTACCAGCTGTATCATCAAAAGATTGGA 
TCCTCTCAAAAAAGAAAT - AAGAAATT - ACGATTACCAGCTGTATCATCAAAAGATTGGA 
TCCTCTCAAAAAAGAAAT - AAGAAATT - ACAATTACCAGCTGTATCATCAAAAGATTGGA 
TCCTCTCAAAAAAGAAAT - AAGAAATT - ACGATTACCAGCTGTATCAT CAAAAGATTGGA 
TCCTCTCAAAAAAGAAAT - AAGAAATT - ACGATTACCAGCTGTATCATCAAAAGATTGGA 

ACTTGATTTTGGTCAATCGTGACGATAAACATGAAGAATTAAGTCCAGATGTGGTTCCTG 
ACTTGATTTTGGTCAATCGTGACCATAAACATGAAGAATTAAGTCCAGATGTGGTGCCTG 
ACTTGATTTTGGTCAATCGTGACCATAAACATGAAGAATTAAGTCCAGATGTGGTTCCTG 
ACTTGATTTTGGTC^TCGTGACCATAAACATGAAGAATTAAGTCCAGATGTGGTTCCTG 
ACTTGATTTTGGTCAATCGTGACCATAAACATGAAGAATTAAGTCCAGATGTGGTGCCTG 
ACTTGATTTTGGTCAATCGTGACCATAAACATGAAGAATTAAGTCCAGATGTGGTTCCTG 
ACTTGATTTTGGTCAATCGTGACCATAAACATGAAGAATTAAGTCCAGATGTGGTGCCTG 
ACTTGATTTTGGTCAATCGTGACCATAAACATGAAGAATTAAGTCCAGATGTGGTGCCTG 
ACTTGATTTTGGTCAATCGTGACCATAAACATGAAGAATTAAGTCCAGATGTGGTTCCTG 
ACTTGATTTTGGTCAATCGTGACCATAAACATGAAGAATTAAGTCCAGATGTGGTGCCTG 
ACTTGA.TTTTGGTCAATCGTGACCATAAACATGAAGAATTAAGTCCAGATGTGGTGCCTG 

TTGAAAATATTTATTTGGATAAACGTATTACGAAGCAAGCTACTCAGTTTTTAGAGGCTG 
TTGAAAATATTTATTTGGATAAACGTATTACGAAGCAAGCTACTCAGTTTTTAGAGGCTG 
TTGAAAATATTTATTTGGATAAACGTATTACGAAGCAAGCTACTCAGTTTTTAGAGGCTG 
TTGAAAATATTTATTTGGATAAACGTATTACGAAGCAAGCTACTCAGTTTTTAGAGGCTG 
TTGAAAATATTTATTTGGATAAACGTATTACGAAGCAAGCTACTCAGTTTTTAGAGGCTG 
TTGAAAATATTTATTTGGATAAACGTATTACGAAGCAAGCTACTCAGTTTTTAGAGGCTG 
TTGAAAATATTTATTTGGATAAACGTATTACGAAGCAAGCTACTCAGTTTTTAGAGGCTG 
TTGAAAATATTTATTTGGATAAACGTATTACGAAGCAAGCTACTCAGTTTTTAGAGGCTG 
TTGAAAATATTTATTTGGATAAACGTATTACGAAGCAAGCTACTCAGTTTTTAGAGGCTG 
TTGAAAATATTTATTTGGATAAACGTATTACGAAGCAAGCTACTCAGTTTTTAGAGGCTG 
TTGAAAATATTTATTTGGATAAACGTATTACGAAGCAAGCTACTCAGTTTTTAGAGGCTG 

CTAGAGCAATTGATTCACGAGAACATTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGG 
CTAGAGCAATTGATTCACGAGAACATTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGG 
CTAGAGCAATTGATTCACGAGAACATTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGG 
CTAGAGCAATTGATTCACGAGAACATTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGG 
CTAGAGCAATTGATTCACGAGAACATTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGG 
CTAGAGCAATTGATTCACGAGAACATTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGG 
CTAGAGCAATTGATTCACGAGAACATTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGG 
CTAGAGCAATTGATTCACGAGAACATTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGG 
CTAGAGCAATTGATTCACGAGAACATTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGG 
CTAGAGCAATTGATTCACGAGAACATTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGG 
CTAGAGCAATTGATTCACGAGAACATTTAATTTCGGGTTATCGTAGTGTTGCCTATCAGG 

AGAAGTTGTTCAATTCTTATGTTACTCAAGAGATGACTAGTAACCCTAATTTGACGAGGG 
AGAAGTTGTTCAATTCTTATGTTACTCAAGAGATGACTAGTAACCCTAATTTGACGAGGG 
AGAAGTTGTTCAATTCTTATGTTACTCAAGAGATGACTAGTAACCCTAATTTGACGAGGG 
AGAAGTTGTTCAATTCTTATGTTACTCAAGAGATGACTAGTAACCCTAATTTGACGAGGG 
AGAAGTTGTTCAATTCTTATGTTACTCAAGAAATGACTAGTAACCCTAATTTGACGAAGG 
AGAAGTTGTTCAATTCTTATGTTACTCAAGAGATGACTAGTAACCCTAATTTGACGAGGG 
AGAAGTTGTTCAATTCTTATGTTACTCAAGAGATGACTAGTAACCCTAATTTGACGAGGG 
AGAAGTTGTTCAATTCTTATGTTACTCAWGAAATGACTAGTAACCCTAATTTGACGAAGG 
AGAAGTTGTTCAATTCTTATGTTACTCAAGAGATGACTAGTAACCCTAATTTGACGAGGG 
AGAAGTTGTTCAATTCTTATGTTACTCAAGAGATGACTAGTAACCCTAATTTGACGAGGG 
AGAAGTTGTTCAATTCTTATGTTACTCAAGAGATGACTAGTAACCCTAATTTGACGAGGG 
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Table 22: Comparative Sequences relating to SAG0093 
(D-alanyl-D-alanine carboxypeptidase family protein) 



SEQ2201 
SEQ2202 
SEQ2203 
SEQ22 04 
SEQ22 0 5 
SEQ22 0 6 
SEQ2207 
SEQ2208 
SEQ22 0 9 
SEQ2210 
SEQ2211 

SEQ22 01 
SEQ2202 
SEQ2203 
SEQ2204 
SEQ2205 
SEQ2206 
SEQ2207 
SEQ22 0 8 
SEQ2209 
SEQ2210 
SEQ2211 

SEQ22 01 
SEQ2202 
SEQ2203 
SEQ2204 
SEQ2205 
SEQ2206 
SEQ2207 
SEQ22 0 8 
SEQ22 0 9 
SEQ2210 
SEQ2211 

SEQ2201 
SEQ2202 
SEQ2203 
SEQ2204 
SEQ22 05 
SEQ2206 
SEQ22 07 
SEQ2208 
SEQ220 9 
SEQ2210 
SEQ2211 

SEQ22 01 
SEQ2202 
SEQ2203 
SEQ2204 
SEQ2205 
SEQ220 6 
SEQ2207 
SEQ22 0 8 
SEQ2209 
SEQ2210 
SEQ2211 



AC^Gt^GAAAAGTTGGTAAAAACTTACTCTCAGCCTGCAGGTGCTAGTGAAmCCAGA 
AC2UVGCAGAAAAGTTGGTAAAAACTTACTCTCAGCCTGCAGGTGCTAGTGAACACCAGA 
ACTiAGCAGAAAAGTTGGTAAAAACTTACTCTCAGCCTGCAGGTGCTAGTGAACACCAGA 
AC^GCAGAAAAGTTGGTAAAAACTTACTCT(^GCCTGC^GGTGCTAGTGAAC^CCAGA 
ACAAGCAGAAAAGTTGGTAAAAACTTACTCTCAGCCTGCAGGTGCTAGTGAACACCAGA 
ACAAGCAGAAAAGTTGGTAAAAACTTACTCTCAGCCTGCAGGTGCTAGTGAACACCAGA 
ACAAGCAGAAAAGTTGGTAAAAACTTACTCTC^GCCTGC^GGTGCTAGTGAAC^CCAGA 
ACAAGCAGAAAAGTTGGTAAAAACTTACTCTCAGCCTGCAGGTGCTAGTGAACACCAGA 
ACAAGCAGAAAAGTTGGTAAAAACTTACTCTCAGCCTGCAGGTGCTAGTGAACACCAGA 
ACAAGCAGAAAAGTTGGTAAAAACTTACTCTCAGCCTGGAGGTGCTAGTGAACACCAGA 
ACAAGCAGAAAAGTTGGTAAAAACTTACTCTCAGCCTGCAGGTGCTAGTGAACACCAGA 

CTGGATTAGCGATGGATATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAG 
CTGGATTAGCGATGGATATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAG 
CTGGATTAGCGATGGATATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAG 
CTGGATTAGCGATGGATATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAG 
CTGGATTAGCGATGGATATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAG 
CTGGATTAGCGATGGATATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAG 
CTGGATTAGCGATGGATATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAG 
CTGGATTAGCGATGGATATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAG 
CTGGATTAGCGATGGATATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAG 
CTGGATTAGCGATGGATATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAG 
CTGGATTAGCGATGGATATGAGTACTGTAGATTCTTTGAATGAGAGCGATCCTAGAGTAG 

TCAGTCAGTTGAAAAAGATAGCTCCACAATATGGTTTTGTCTTACGGTTTCCGGATGGTA 
TCAGTCAGTTGAAAAAGATAGCTCCACAATATGGTTTTGTCTTACGGTTTCCGGATGGTA 
TCAGTCAGTTGAAAAAGATAGCTCCACAATATGGTTTTGTCTTACGGTTTCCGGATGGTA 
TCAGTCAGTTGAAAAAGATAGCTCCACAATATGGTTTTGTCTTACGGTTTCCGGATGGTA 
TCAGTCAGTTGAAAAAGATAGCTCCACAATATGGTTTTGTCTTACGGTTTCCGGATGGTA 
TCAGTCAGTTGAAAAAGATAGCTCCACAATATGGTTTTGTCTTACGGTTTCCGGATGGTA 
TCAGTCAGTTGAAAAAGATAGCTCCACAATATGGTTTTGTCTTACGGTTTCCGGATGGTA 
TCAGTCAGTTGAAAAAGATAGCTCCACAATATGGTTTTGTCTTACGGTTTCCGGATGGTA 
TCAGTCAGTTGAAAAAGATAGCTCCACAATATGGTTTTGTCTTACGGTTTCCGGATGGTA 
TCAGTCAGTTGAAAAAGATAGCTCCACAATATGGTTTTGTCTTACGGTTTCCGGATGGTA 
TCAGTCAGTTGAAAAAGATAGCTCCACAATATGGTTTTGTCTTACGGTTTCCGGATGGTA 

AAACAGCAGAAACAGGGGTAGGTTATGAAGATTGGCATTACCGCTATGTTGGGGTAGAGT 
AAACAGCAGAAACAGGGGTAGGTTATGAAGATTGGCATTACCGCTATGTTGGGGTAGAGT 
AAACAGCAGAAACAGGGGTAGGTTATGAAGATTGGCATTACCGCTATGTTGGGGTAGAGT 
AAACAGCAGAAACAGGGGTAGGTTATGAAGATTGGCATTACCGCTATGTTGGGGTAGAGT 
AAACAGCAGAAACAGGGGTAGGTTATGAAGATTGGCATTACCGCTATGTTGGGGTAGAGT 
AAACAGCAGAAACAGGGGTAGGTTATGAAGATTGGCATTACCGCTATGTTGGGGTAGAGT 
AAACAGCAGAAACAGGGGTAGGTTATGAAGATTGGCATTACCGCTATGTTGGGGTAGAGT 
AAACAGCAGAAACAGGGGTAGGTTATGAAGATTGGCATTACCGCTATGTTGGGGTAGAGT 
AAACAGCAGAAACAGGGGTAGGTTATGAAGATTGGCATTACCGCTATGTTGGGGTAGAGT 
AAACAGCAGAAACAGGGGTAGGTTATGAAGATTGGCATTACCGCTATGTTGGGGTAGAGT 
AAACAGCAGAAACAGGGGTAGGTTATGAAGATTGGCATTACCGCTATGTTGGGGTAGAGT 

CTGCAAAATATATGGCCAAACATC&TTTAAC^ 

CTGCAA^TATATGGCCGAACATCGTTTAACATTAGAAGAATACATAACTTTATTAAAGG 
CTGCAAAATATATGGCCAAACATCATTTAACATTAGAAGAATACATAACTTTATTAAAGG 
CTGCAAAATATATGGCCAAAC^TC^TTTAAC^TTAGAAGAATAC^TAACTTTATTAAAGG 
CTGCAAAATATATGGCCAAACATCATTTAACATTAGAAGAATACATAACTTTATTAAAGG 
CTGCAAAATATATGGCCAAACATGATTTAACATTAGAAGAATACATAACTTTATTAAAGG 
CTGCAAAATATATGGTCAAACATCATTTAACATTAGAAGAATACATAACTTTATTAAAGG 
CTGC^AAATATATGGCCAAACATCATTTAACATTAGAAGAATACATAACTTTATTAAAGG 
CTGCMAATATATGGCCAAAC^TCATTTAA(^TTAGAAGAATACATAACTTTATTAAAGG 
CTGCAAAATATATGGTCAAACAT(^TTTAACATTAGAAGAATACATAACTTTATTAAAGG 
CTGCAAAATATATGGT CAAACATCATT TAACATTAGAAGAATACATAACTTTATTAAAGG 



583 



WO 2004/018646 



PCT/US2003/026827 



Table 22: Comparative Sequences relating to SAG0093 
(D-alanyl-D-alanine carboxypeptidase family protein) 



SEQ2201 
SEQ2202 
SEQ2203 
SEQ2204 
SEQ2205 
SEQ22 06 
SEQ2207 
SEQ220 8 
SEQ220 9 
SEQ2210 
SEQ2211 



AGAATAACCAA 

AGAATAA C CAA 

AGAATAACCAA 

AGAATAACCAAAACCCAGCTTTCTTGTACAA- 

AGAATAACCAA 

AGAATAACCAA 

AGAATAAC CAAAAC C CAG C T T T CT TGTACAA 

AGAATAACCAA 

AGAATAACCAA 

AGAATAAC CAAAAC CCAGCTTTCTT 

AGAATAACCAATAB CMARATVS TNCS RATNGTS AGDAANYDAAI^CARBX YTDASAMYRT 



>SEQ ID NO 2250: 18_090 frame: 1 

KPNSQQSSSQKiRNEDIKKISSQKOTKKLQLPAVSSKnDWNLILVNRDHKHEELSPDWPV 
ENIYIiDKRITKQATQFLEAARAIDSREHLISGYRSVAYQEKLFNSYVTQEMTSNPNIjTRG 
QAEKLVKTYSQPAGASEHQTGLAMDMSTVDSLNESDPRWSQLKKIAPQYGFVLRPPDGK 
TAETGVGYEDWHYRYVGVESAKYMAKHHLTLEEYITLLKENNQ 

>SEQ ID NO 2251: 18_1169NT frame: 1 

KPNSQQSSPQKLRNEDIKKISSQKRNKKLRLPAVSSKDWNLILVNRDHKHEELSPDWPV 
ENIYIiDKRITKQATQFLEAARAIDSREHIiISGYRSVAYQEKLFNSYVTQEMTSNPNLTRG 
QAEKLVKTYSQPAGASEHQTGLAMDMSTVDSLNESDPRVVSQLKKIAPQYGFVLRFPDGK 
T AE TGV G Y ED WHYR YVGVE S AK YMAEHRL TL E E Y I TLL KENNQ 

>SEQ ID NO 2252: 18_18RS21 frame: 1 

KPNSQQ S S S QKLRNED I KKI S S QKRNKKLQLPAVS S KDWNL I LVNRDHKHE ELS PDWPV 
ENIYLDKRITKQATQFLEAARAIDSREHLISGYRSVAYQEKLFNSYVTQEMTSNPNLTRG 
QAEKLVKTYSQPAGASEHQTGLAMDMSTVDSLNESDPRWSQLKKIAPQYGFVLRFPDGK 
TAE TGV GYEDWHYR YVGVE S AKYMAKHHLTLEE YI TLLKENNQ 

>SEQ ID NO 2253: 18_2603 frame: 3 

SQQSSSQKiRNEDIKKISSQKRNKKLQLPAVSSKDWNLILVNRDHKHEELSPDWPVENI 
YLDKRI TKQATQFLEAARAIDSREHLI SGYRSVAYQEKLFNS YVTQEMTSNPNLTRGQAE 
KLVKTYSQPAGASEHQTGLAMDMSTVDSLNESDPRVVSQLKKIAPQYGFVLRFPDGKTAE 
TGVGYEDWHYRWGVE S AKYMAKHHLTLEE YI TLL KENNQNPAFLY 

>SEQ ID NO 2254: 18_A909 frame: 1 

KPNS QQ SSSQ KLRNED I KKTS S QKRNKKLRLPAVS S KDWNL I LVNRDH KHE EL S PDWPV 
ENIYLDKRITKQATQFLEAARAIDSREHLISGYRSVAYQEKLFNSYVTQEMTSNPNLTKE 
OAEKLVKTYSQPAGASEHQTGIiAMDMSTVDSLNESDPRWSQLKKIAPQYGFVLRFPDGK 
TAETGVGYEDWHYR YVGVE SAKYMAKHHLTLE EYI TLLKENNQ 

>SEQ ID NO 2255:18CJB110 frame: 1 

KPNSQQSSSQKLRNEDIKKISSQKRNKKFTITSCIIKRLELDFGQS 

>SEQ ID NO 2256:18_C0H1 frame: 1 

PNSQQSSSQKLRNEDIKKTSSQKRN 

>SEQ ID NO 2257: 18_H36B frame: 1 

KPNSQQSSSQKLRNEDIKKTSSQKRNKKLRLPAVSSKDWNLILVNRDHKHEELSPDWPV 
ENIYLDKRITKQATQFLEAARAIDSREHLISGYRSVAYQEKLFNSYVTXEMTSNPNLTKE 
QAEKLVKTYSQPAGASEHQTGLAMDMSTVDSLNESDPRVVSQLKKIAPQYGFVLRFPDGK 
TAETGV GYEDWHYRYVGVE SAKYMAKHHLTLE EYI TLLKENNQ 

>SEQ ID NO 2258: 18_JM9130013 frame: 1 

KPNSQQSSSQKLFJSfEDIKKISSQKPJSTKKLQLPAVSSK^ 

ENIYLDKRITKQATQFLEAARAIDSREHLISGYRSVAYQEKLFNSYVTQEMTSNPNLTRG 
QAEKLVKTYSQPAGASEHQTGIjAMDMSTVDSLNESDPRVVSQLKKIAPQYGFVLRFPDGK 
TAETGVGYEDWHYR YVGVESAKYMAKHHLTLEEYI TLLKENNQ 

>SEQ ID NO 2259:18_M732 frame: 3 

PNSQQSSSQKLRNEDIKKTSSQKRNKKLRLPAVSSKDWNLILVNRDHKHEELSPDWPVE 
NI YLDKRI TKQATQFLEAARAI DS REHL I S G YRS VAYQEKL FNS YVTQEMTSNPNLTRGQ 
AEKLVKTYSQPAGASEHQTGLAMDMSTVDSLNESDPRWSQLKKIAPQYGFVLRFPDGKT 
AETGVGYEDWHYRYVGVESAKYMVKHHLTLEEYITLLKENNQNPAF 
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Table 22: Comparative Sequences relating to SAG0093 
(D-alanyl-D-alanine carboxypeptidase family protein) 

>SEQ ID NO 2260: 18_M781 frame: 1 

KPNSQQS S SQKLRNED I KKTS S QKRNKKLRLPAVS S KDWNLI LVNRDHKHEEL S PDWPV 
ENIYIiDKRITKQATQFLEAARAIDSREHLISGYRSVAYQEKLFNSYVTQEMTSNPNLTRG 
QAEKLVKTYSQPAGASEHQTGLAMDMSTVBSLNESDPRWSQLKKIAPQYGFVIiRFPDGK 
TAETGV GYEDWHYRYVGVESAKYMVKHHLTLEE YI TLLKENNQ 



SEQ22 5 0 
SEQ2251 
SEQ2252 
SEQ2253 
SEQ2254 
SEQ2255 
SEQ225 6 
SEQ2257 
SEQ2258 
SEQ225 9 
SEQ226 0 



PNSQQSSSQKLRNEDIKKISSQIOiNKKLQLPAVSSKDWNLILVNRDHKHEELSPDWP^ 
PNS QQS S PQKLRNEDI KKI S S QKRNKKLRL PAVS S KDWNL I LVNRDHKHEELS PDVVPV 
PNSQQSSSQKLRNEDIKKISSQKMKKLQLPAVSSKDWNLILVNRDHKHEELSPDWPV 
--SQQSSSQKLRNEDIKKISSQKRl^CLQLPAVSSKDWmiLVNRDHKHEELSPDVVPV 
PNSQQSSSQKljRNEDIKKTSSQKRNKKiRLPAVSSKDWNIjILVimDHKHEELSPDWPV 

PNSQQSSSQKLRNEDIKKISSQKRNKKFTITSCIIKRLEL DFGQS 

PNSQQSSSQKLRNEDIKKTSSQKRN 

PNSQQS S S QKLRNEDI KKTS S QKRNKKLRIjPAVS S KDWNL I LVNRDHKHEELS PDWPV 
PNS QQS SSQKIjRNEDI KKI SSQKRNKKLQLPAVSS KDWNL I LVNRDHKHEELS PDWPV 
PNSQQS SS QKLRNEDI KKTS S QKRNKKLRLPAVS S KDWNL I LVNRDHKHEELS PDWPV 
PNS QQS SS QKLRNEDI KKTS SQKRNKKLRL PAVS S KDWNLI LVNRDHKHEELS PDWPV 



SEQ2250 
SEQ2251 
SEQ2252 
SEQ2253 
SEQ2254 
SEQ2255 
SEQ2256 
SEQ2257 
SEQ2258 
SEQ2259 
SEQ226 0 



NI YLDKRI TKQATQFLEAARAI DS REHL I SG YRS VAYQEKL FNS YVTQEMT SNPNLTRG 
NIYLDKRITKQATQFLEAARAIDSREHLISGYRSVAYQEKLFNSYVTQEMTSNPNLTRG 
NIYLDKRITKQATQFLEAARAIDSREHLISGYRSVAYQEKLFNSYVTQEMTSNPNLTRG 
NIYLDK^ITKQATQFLFAARAIDSREHLISGYRSVAYQEKLFNSYVTQEMTSNPNLTRG 
NIYLDKRI TKQATQFLEAARAI DS REHL I SGYRSVAYQEKLFNSYVTQEMTSNPNLTKE 



NIYLDKRITKQATQFLEAARAIDSREHLISGYRSVAYQEKLFNSYVTXEMTSNPNLTKE 
NIYLDKRITKQATQFLEAARAIDSREHLISGYRSVAYQEKLFNSYVTQEMTSNPNLTRG 
NIYLDKRITKQATQFLEAARAIDSREHLISGYRSVAYQEKLFNSYVTQEMTSNPNIiTRG 
NIYLDKRI TKQATQFLEAARAIDS REHL I SGYRSVAYQEKLFNSYVTQEMTSNPNLTRG 



SEQ2250 
SEQ2251 
SEQ2252 
SEQ2253 
SEQ2254 
SEQ2255 
SEQ2256 
SEQ2257 
SEQ2258 
SEQ2259 
SEQ2260 



AEKLVKTYSQPAGASEHQTGLAMDMSTVDSLNESDPRWSQLKKIAPQYGFVLRFPDGK 
AEKLVKTYSQPAGASEHQTGLAMDMSTVDSLNESDPRWSQLKKIAPQYGFVLRFPDGK 
AEKLVKTYSQPAGASEHQTGLAMDMSTVDSIiNESDPRWSQLKI^IAPQYGFVLRFPDGK 
AEKLVKTYSQPAGASEHQTGLAMDMSTVDSLNESDPRWSQLK^IAPQYGFVLRFPDGK 
AEKLVKTYSQPAGASEHQTGLAMDMSTVDSLNESDPRWSQLKKIAPQYGFVLRFPDGK 



AEKX.VKTYSQPAGASEHQTGLAMDMSTVDSLNESDPRWSQLKKIAPQYGFVLRFPDGK 
AEKXVKTYSQPAGASEHQTGLAMDMSTVDSLNESDPRWSQLKKIAPQYGFVLRFPDGK 
AEKLVKTYSQPAGASEHQTGLA>TOMSTVDSIaNESDPRWSQLKKIAPQYGFVLRFPDGK 
AEKLVKTYSQPAGASEHQTGLAMDMSTVDSLNESDPRWSQLKKIAPQYGFVLRFPDGK 



SEQ2250 
SEQ2251 
SEQ2252 
SEQ2253 
SEQ2254 
SEQ2255 
SEQ2256 
SEQ2257 
SEQ2258 
SEQ2259 
SEQ22 60 

SEQ2250 
SEQ2251 
SEQ2252 
SEQ2253 
SEQ2254 
SEQ2255 
SEQ2256 
SEQ2257 
SEQ2258 
SEQ2259 
SEQ2260 



AETGVGYEDWHYRYVGVESAKYMAKHHLTLEEYITLLKENNQ 

AETGVGYEDWHYRYVGVESAKYMAEHRLTLEEYITLLKENNQ 

AETGVGYEDWHYRYVGVESAKYMAKHHLTLEEYITLLKENNQ 

AETGVGYEDWHYRYVGVESAKYMAKHHLTLEEYITLLKENNQNPAFLY 

AETGVGYEDWHYRYVGVESAKYMAKHHLTLEEYITLLKENNQ- 

AETGVGYEDWHYRYVGVESAKYMAKHHLTLEEYI TLLKENNQ 

AE TGVGYEDWHYRYVGVE S AKYMAKHHL TL EE Y I TL L KENNQ 

AETGVGYEDWHYRYVGVESAKYMVKHHLTLEEYITLLKENNQNPAF 

AETGVGYEDWHYRYVGVESAKYMVKHHLTLEEYITLLKENNQTABLECMPARATIVESE 

ENCESRELATINGTSAGDALANYLDALANINECARBXYPEPTIDASEFAMILYPRTEIN 
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Table 23: Comparative Sequences relating to SAG0163 

(competence protein Cgl A) 

SEQ ID NO. 2301: SAG0163 FROM THE 090 GBS TYPE III STRAIN (REVERSE COMPLEMENT) 

GGC^GTAGAAGTAAATGCTCAAGATATTTATATCATTCCCA^ 

GTTTATTGATGTTTTTGAGTTTAATAGGATGGCTAGTCTTATTAGTCACTTTAAATTTGTGGCAG^ 

ACGAAGTCAATTAGGTTCTTGTGACTATGAACTGTCAGAGGGAAGACTGGTTTCATTACGACTATCGAGTGTG^ 

TCAAGAATCTTTAGTTATTCGTATTTTGTATTCAGGT 

ACTGGGTACAAGAGGGCTATATCTTTTTTCCGGCCCTGTGGGGAGT^ 

TAAAAATAAGCAAATTATGACGATTGAAGATCCGGTAGAAATCAAGAATGAC^^ 

AATGACTTATGATGCTTTAATCAAACTGTCTTTACGGCATCGTCCAGATATTTT^ 

CCGTGCTGTTATTCGTGC^GTTTAACGGGAGTGATGGTTTTTTCTACTATTC^TGCTAAAAGTATTTCCGGAGTCTATGATAGGCT 
TATAGAATTAGGGGTTAACTATCAAGAGTTAGAAAATAGTCTAAAATTAATAGCATATCAACGTTTAA 
TGACTTTGAGACAGGTAACTTTAAAAAACACTCATCAGACAAGTGGAAT^ 
TAAGAAACAGGCACAAGTCGAAAAAATTATCCCTCAAGAAACAACGGA 

SEQ ID NO. 2302: SAG0163 FROM THE 1169NT1 GBS TYPE V STRAIN (REVERSE COMPLEMENT) 

GGTGATTGTTATGAAACCTCTACTATTGCGTATTTGATGATGAAAGGCGGTTTATTGATGTTTTTGAGTTTAATAGGATGGCTAGTC 
TTATTAGTCACTTTAAATTTGTGGCMGCATGAACGTTGGAGAAAAAAGACGAA 
AGGGAAGACTGGTTTCATTACGACTATCGAGTGTGGGAGATTATCGTGGTCAAGA^^ 
ATC^GGACTTAAAATATTGGTTTGATAATATAAAGCAAATGAAGGAA 

TGGGGAGTGGTAAAACAACTCTCATGTATCAATTAGCTTt^GAAGTATTTAAAAATAAGC^^ 

AAATCAAGAATGACAAGATGTTACAACTCGAATTGAATGAGGATATTGGAATG^ 

ATCGTCCAGATATTTTAATTATCGGAGAGATTAGAGATCAAGCGACGGCTCGTGCTGTT^ 

TT TT T T CTACTATT CAT GC TAAAAGTAT T C CC GGAGT CTATGATAGGC T TATAGAATTAGGGGT TAA CTAT CAAGAGT TAGAAAATA 
GTCTAAAATTAATAGCATATGAACGTTTAATTGGAGGAGGAAGCCTA^ 
ACAAGTGGAATAGACAAGTGGATATCTTGGCTGAAGAAGGATATATCAGTAAGAAACAGG 
AAACAACGGAAAGTAGTCCAACTTTT 

SEQ ID NO. 2303: SAG0163 FROM THE 18RS21 GBS TYPE II STRAIN (REVERSE COMPLEMENT) 

GTTCAATCATTAGCAAAGCAAGTCATTCATC^ 

GAACTCTATATGCGTATTGATGATGAAAGGCGGTTTATTGATGTTTTTGAGTTTAATAGGATGGCTAGTCTTATTAGTCACTTTAAA 

TTTGTGGCAGGCATGAACGTTGGAGAAAAAAGACGAAGTCAATTAGGTTCTTGTGACT^ 

TTACGACTATCGAGTGTGGGAGATTATCGTGGTCAAGAATCTTTAGTTATTCGTATTT^ 

TGGTTTGATAATATAAAGCAAATGAAGGAAGTACTGGGTATAAGAGGGCTATATCT^ 

ACTCTCATGTATCAATTAGCTT CAGAAGTATT TAAAAATAAGCAAATTATCACGATTG 

ATGTTACAACTCCAATTGAATGAGGATATTGGAATGACTTATGATGCTTTAATCAAACTGTCTTTACGGCA 

ATTATCGGAGAGATTAGAGATCAAGCGACGGCCCGTGCTGTTATTCGTGCAAGTTTAACGGGAGTGATGGTTTTTTCTACTATTCAT 

GCTAAAAGTATTCCCGGAGTCTATGATAGGCTTATAGAATTAGGGGTTAACTATCAAGAGTTAGAAAATAGTCTAAAATTAATAGC^ 

TAT CAACGTTTAAT TGGAGGAGGAAGCCTAATTGACTTTGAGACAGGTAATTTTAAAAAACACTCATCAGACA^ 

GTGGATATCTTGGCTGAAGAAGGACATATCAGTAAGAAACAGGC^CAAGTCGAAAAAATTATCC 

CCAACTTTT 

SEQ ID NO. 2304: SAG0163 FROM THE 2603 V/R GBS TYPE V STRAIN (REVERSE COMPLEMENT) 

GATATTTATATCATTCCCAAAGGTGATTGTTATGAACTCTATATGCGTATTGaTGATGAAAGGCGGTTTATTGATGTTTTTGAGTTT 
AATAGGATGGCTAGTCTTATTAGTGA.CTTT^ 

GACTATGAACTGTC^GAGGGAAGACTGGTTTC^TTACGACTATCGAGTGTGGGAGATTATCGTGGTCMGAATCTTTAGTTATTCGT 
ATTTTGTATTCAGGTCATCAGGACTTAAAATATTGGTTTGATAATATAA^ 

CTTTTTTCCGGCCCTGTGGGGAGTGGTAAAACAACTCTCATGTATCAATTAGC^ 

ATTGAAGATCCGGTAGAAATCAAGAATGACAAGATGTTACAACTCCAATTGAATGAGGATATTGGA^ 

AAACTGTCTTTACGGCATCGTCCAGATATTTTAATTATCGGAGAGATTA 

TTAACGGGAGTGATGGTTTTTTCTACTATTCATGCTAAAAGTATTCCCGGAGTCTATGATAGGCTTATAGAATTAGGGGTTAACTAT 

CAAGAGTTAGAAAATAGTCTAAAATTAATAGCATATCAACGTTTA^ 

AAAAAAC^CTCATCAGACAAGTGGAATAGACAAGTGGAT^ 

AAAAATTATCCCTCAAGAAACAACGGAAAGTAGTCCAACTTTT 
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Table 23: Comparative Sequences relating to SAGO 163 

(competence protein Cgl A) 

SEQ ID NO. 2305: SAGO 16 3 FROM THE A909 GBS TYPE la STRAIN (REVERSE COMPLEMENT) 

GTTCAATCATTAGC?AAAGC!AAGTCATTCATCAGG 

G^ACTCTATATGCGTATTGATGATGAAAGGCGGTTTATTGATGTTTTTGAGTTTAATAGGATGGCTAGTCTTATTAGTCACTTTAAA 
TTTGTGGCAGGCATGAACGTTGGAGAA 

TTACGACTATCGAGTGTGGGAGATTATCGTGGTCAAGAATCTTTAGTTATTCGTATTTTGT^ 
TGGTTTGATAATATAAAGCAAATGAAGGAAGTACTGGGTATAAGAGGGCTATA^ 
ACT CTCATGTATCAATTAGCTTCAGAAGTATTTAAAAATAAGCAAATTATC^ 
ATGTTACAACTCCAATTGAATGAGGATATTGGAATGACTTATGATGCTTT^ 

ATTATCGGAGAGATTAGAGATCAAGCGACGGCCCGTGCTGTTATTCGTGCAAGTTTAACGGGAGTGATGGTTTTTTCTACTATTCAT 
GCTAAAAGTATTCCCGGAGTCTATGATAGGCTTATAGAATTAGGGGTTAACra^ 

TATCAACGTTTAATTGGAGGAGGAAGCCTAATTGACTTTGAGACAGGTAATTTTAAAAAACACTCATCAG^ 

GTGGATATCTTGGCTGAAGAAGGACATATCAGTAAGAAACAGGCACAAGTCG 

CCAACTTTT 

SEQ ID NO. 2306: SAGO 16 3 FROM THE CJB110 GBS NONTYPEABLE STRAIN (REVERSE COMPLEMENT) 

GTTCAATCATTAGCAAAGCAAGTCATTCATC 

GAACTCTATATGCGTATTGATGATGAAAGGCGGTTTATTGATGTTTTTGAGTTTAATAGGATGGCTAGTCTTATTAGTCACTTTAAA 
TTTGTGGCAGGCATGAACGTTGGAGAAAAAAGACGAAGTCAATTAGGTTCTTGTGACTATGAACT 

TTACGACTATCGAGTGTGGGAGATTATCGTGGTCAAGAATCTTTAGTTATTCGTATTTTGTATTCAGGTCATCAGGACTTA 
TGGTTTGATAATATAAAGCAAATGAAGGAAGTACTGGGTACAAGAGGGCTATATCTTTTTTCCGGCCCTGTGGGGAGTGGT 
ACTCTCATGTATCAATTAGCTTCAGAAGTATTTAAAAATAAGCAAATTATC^ 

ATGTTACAACTCCAATTGAATGAGGATATTGGAATGACTTATGATGCTTTAATCAAACTGTCTTTACGGCATCGTCC^ 

ATTATCGGAGAGATTAGAGATC^GCGACGGCCCGTGCTGTTATTCGTGCAAGTTTAACGGGAGTGATGGTTTTTTCTACTATTCAT 

G CT AAAAGTAT TTCCGGAGT C TAT GATAGGC T TATAGAAT TAGGGGTTAAC TAT CAAGAGT TAGAAAATAGT C TAAAATTAATAGCA 

TATCAACGTTTAATTGGAGGAGGAAGCCTAATTGACTTTGAGACAGGTAACTTTA^ 

GTGGATATCTTGGCTGAAGAAGGACATATCAGTAAGAAACAGGCACAAGTCGAAA^ 

CCAACTTTT 

SEQ ID NO. 2307: SAG0163 FROM THE COHl GBS TYPE III STRAIN (REVERSE COMPLEMENT) 

AGGTGATTGTTATGAAATTCTATATGCGTATTGATGATGAAAGGCGGTTTATTGATGTTTTTGAGTTTAATAGGATGGCTAGTCTTA 

TTAGTCJACTTTAAATTTGTGGCAGGGATGAACGTTGGAGAAA 

GAAGACTGGTTTCATTACGACTATCA&GTGTGGGAGATTATCGT 

AGGACTTAAAATATTGGTTTGATAATATAAAGTAAATGAAGGAAGTACTGTGTGC^GAGGGCTATATCTTTTTTCCGGCCCTGTGG 
GGAGTGGTAAAACAACTCTC^TGTATGAATTAGCTTCAGAAGTATTTAAA 

TCAAGAATGACAAGATGTTACAACTCCAATTGAATGAGGATATTGGAATGACTTATGATGCTTTA 

GTCC^GATATTTTAATTATCGGAGAGATTAGAGATCAAGCGACGGCCCGTGCTGTTATTCGTGCAAGTTTAACGGGAGTAATGGTTT 

TTTCTACTATTCATGCTAAAAGTATTCCCGGAGTCTATGATAGGCTTATAGAATTAGGGGTTAACTATCAAGAGTm 

TAAAATTAATAGCATATCAACGTTTAATTGGAGGAGGAAGCCTAATTGAC^ 

AGTGGAATAGACAAGTGGATATCTTGGCTGAAGAAGGACATAT(^GTAAGAAACA 

CAACGGAAAGT AGT C CAAC T TT T 

SEQ ID NO. 2308: SAG0163 FROM THE H36b GBS TYPE lb STRAIN (REVERSE COMPLEMENT) 

TCATTAGCAAAGCAAGTCATTCATCAGGCAGT^ 

TATATGCGTATTGATGATGAAAGGCGGTTTATTGATGTTTTTGAGTTTAATAGGATGGCTAGTCTTATTAGTCACTTTAAATTTGTG 
GCAGGCATGAACGTTGGAGAAAAAAGACGAAGTC!AATTAGGTTCTTGTGACTATGAACTGTCAGA 
CTATCGAGTGTGGGAGATTATCGTGGTCAAGAATCTTTAGTTATTCGTATTTTGTATTC^ 
GATAATATAAAGCAAATGAAGGAAGTACTGGGTATAAGAGGGCTAT^ 

ATGTATCIAATTAGCTTCAGAAGTATTTAAAAATAAGCAAATTATCACGATTGAAGATCCGGTA 

CAACTCCAATTGAATGAGGATATTGGAATGACTTATGATGCTTTAATCAAACTC 

GGAGAGAAATAGAGATCAAGCGACGGCCCGTGCTGTTATTCGTGGAAGTT^ 

AAGTATTCCCGGAGTCTATGATAGGCTTATAGAATTAGGGGTTAACTATCAAGAGTTAGAAAATAGTCTAAAATTAATAGCATATCA 

ACGTTTAATTGGAGGAGGAAGCCTAATTGACTTTGAGACAGGTAATT^ 

TATCTTGGCTGAAGAAGGACATATCAGTAAGAAACAGGC!^ 

TTTT 
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Table 23: Comparative Sequences relating to SAG0163 

(competence protein CglA) 

SEQ ID NO. 2309: SAG0163 FROM THE JM9130013 GBS TYPE VIII STRAIN (REVERSE COMPLEMENT) 

GTTCAATC^TTAGCAAAGCAAGTCATTC^ 

GAACTCTATATGCGTATTGATGATGAAAGGCGGTTTATTGATGTTTTTGAGTTTAATA 

TTTGTGGCAGGCATGAACGTTGGAGAAAAAAGACGAAGTCAATTAGGTTCTTGTGACTATGAACTG 

TTACGACTATCGAGTGTGGGAGATTATCGTGGTCTIAGAATCTTT^ 

TGGTTTGATAATATAAAGCAAATGAAGGAAGTACTGGGTATAAGAG^ 

ACTCTC^TGTATCAATTAGCTTCAGAAGTATTTAAAAATAAGCAAATTAT^ 

ATGTTACAACTCCAATTGAATGAGGATATTGK3AATGACTTATGATGCTTTA^ 

ATTATCGGAGAGATTAGAGATCAAGCGACGGCCCGTGCTGTTATTCGTGCAAGTTTAACGGGAGTGATGGTTTTTTCTACTATTCAT 

GCTAAAAGTATTCCCGGAGTCTATGATAGGCTTATAGAATTAGGGGTTAACTATCAAGAGTTAGAAAATAGTCTAAAATTAATAG^ 

TATCAACGTTTAATTGGAGGAGGAAGCCTAATTGACTTTGAGACAGGTAATTTTAAA 

GTGGATATCTTGGCTGAAGAAGGACATATCAGTAAGAAACAGGCACAAGTCGAA 

CCAACTTTT 

SEQ ID NO. 2310: SAG0163 FROM THE M732 GBS TYPE III STRAIN (REVERSE COMPLEMENT ) 

TGACTTGTTATGAAACTCTATATGCGTATTTGATGATGAAAAGGCGGTTTATTGATGTTTTTGAGTTTAATAGGATGGCTAGTCTTA 

TTAGTCACTTTAAATTTGTGGCAGGCATGAACGTTGGAGAAAAAAGACGAAGTCAATTAGGTTCTTG 

GAAGACTGGTTTCATTACGACTATCAAGTGTGGGAGATTATCGTGGTC^ 

AGGACTTAAAATATTGGTTTGATAATATAAAGTAAATGAAGGAAGTACTGTGTGCAAGAGGGCTATATCTTTTTTCCGGCCCTGTGG 
GGAGTGGTAAAACAACTCTCATGTATCAATTAGCTTCAGAAGTATTTA^ 

TCAAGAATGAGAAGATGTTACAACTCCAATTGAATGAGGATATT^ 

GTCCAGATATTTTAATTATCGGAGAGATTAGAGATCAAGCGACGGCCCGTGCTGTTATTCGTGCAAGTTTAACGGGAGTAATGGTTT 

TTTCTACTATTCATGCTAAAAGTATTCCCGGAGTCTATGATAGGCTTATAGAATTAGGGGTTAACTATCAAGAGTTAGAAAATAGTC 

TAAAATTAATAGCATATCAACGTTTAATTGGAGGAGGAAGCCTAATTGACTTTGAGACAAGTAACTTTAAAAAACACTCAT 

AGTGGAATAGACAAGTGGATATCTTGGCTGAAGAAGGACATATCAGTAAGAAAC^^ 

CAACGGAAAGTAGTCCAACTTTT 

SEQ ID NO. 2311: SAGO 163 FROM THE M781 GBS TYPE III STRAIN (REVERSE COMPLEMENT) 

CAGTAGAAGTAAATGCTCAAGATATTTATATCATTCCCAAAGG 

TTATTGATGTTTTTGAGTTTAATAGGATGGCTAGTCTTATTAGTC^CTTTAAATTTGTGGCAGGCATGAACGTTGGAGAAAAAAGAC 

GAAGTCAATTAGGTTCTTGTGACTATGAACTGTCAGAGGGAAGACTGGTTTCATTACGACTATCAAGTGTGG 

AAGAATCTTTAGTTATTCGTACTTTGTATTGAGGTCATCAGGACTTAAAAT^ 

TGTGTGCAAGAGGGCTATATCTTTTTTCCGGCCCTGTGGGGAGTGGTAAAACAACTCTCATGTATCAATTAGCTTCAGAAGTATT 
AAAATAAGCAAATTATCACGATTGAAGATCCGGTAGAAATCAAGAATGACAAGATGTTACAACTCCAATTGAAT 

TGACTTATGATGCTTTAATCAAACTGTCTTTACGGCATCGTCCAGATATTTTAATTATCGGAGAGATTAGAGATCAAGCGACGGCCC 

GTGCTGTTATTCGTGCAAGTTTAACGGGAGTAATGGTTTTTTCTACTATTCATGCTAAAAGTATTCCCGGAGTCTATGATAGGCTTA 

TAGAATTAGGGGTTAACTATCAAGAGTTAGAAAATAGTCTAAAATTAATAGCATATCAACGTTTAATTGGAGGAGGAAGCCTAATTG 
ACTTTGAGACAAGTAACTTTAAAAAACACTCATCAGACAAGTGGAATA^ 

AGAAACAGGCAC^GTCGAAAAAATTATCCCTCAAGAAAC^CGGAAAGTAGTCCAACTTTT 



588 



WO 2004/018646 



PCT/US2003/026827 



Table 23: Comparative Sequences relating to SAG0163 

(competence protein CglA) 



SEQ2 301 GGCAGTAGAAGTAAATGCTCAAGATATT 

SEQ2302 

SEQ2 303 T TCAATCAT TAGCAAAG CAAGT CATT CAT CAGGCAGTAGAAGTAAATG CTCAAGATAT T 

SEQ2 3 0 4 GATATT 

SEQ2 305 TTCAATCATTAGCAAAGCAAGTCATTCATCAGGCAGTAGAAGTAAATGCTCAAGATATT 

SEQ2 306 TTCAATCATTAGCAAAGCAAGTCATTCATCAGGCAG 

SEQ2307 

SEQ2 308 TCATTAGCAAAGCAAGTCATTCATCAGGCAGTAGAAGTAAATGCTCAAGATATT 

SEQ2 309 TTCAATCATTAGCAAAGCAAGTCATTCATCAGGCAGTAGA^ 

SEQ2310 

SEQ2 311 CAGTAGAAGTAAATGCTCAAGATATT 

SEQ2 301 ATATCATTCCCAAAGGTGA - TTGTTATGAA - CTCTATA TGCGTATT - GATGATGA 

SEQ23 02 GGTGA - TTGTTATGAA - ACCTCTACTATTGCGTATTTGATGATGA 

SEQ2303 ATATCATTCCCAAAGGTGA - TTGTTATGAA - CTCTATA TGCGTATT - GATGATGA 

SEQ2304 ATATCATTCCCAAAGGTGA - TTGTTATGAA - CTCTATA TGCGTATT - GATGATGA 

SEQ2 305 ATATCATTCCCAAAGGTGA- TTGTTATGAA - CTCTATA TGCGTATT - GATGATGA 

SEQ2306 ATATCATTCCCAAAGGTGA - TTGTTATGAA - CTCTATA TGCGTATT - GATGATGA 

SEQ2 307 AGGTGA - TTGTTATGAAATTCTATA TGCGTATT - GATGATGA 

SEQ2 308 ATATCATTCCCAAAGGTGA- TTGTTATGAA- CTCTATA TGCGTATT- GATGATGA 

SEQ23 09 ATATCATTCCCAAAGGTGA -TTGTTATGAA -CTCTATA TGCGTATT- GATGATGA 

SEQ2 310 TGACTTGTTATGAAACTCTATA TGCGTATTTGATGATGA 

SEQ2 311 ATATCATTCCCAAAGGTGA - TTGTTATGAA - TTCTATA TGCGTATT- GATGATGA 

SEQ2 301 AA- GGCGGTTTATTGATGTTTTTGAGTTTAATAGGATGGCTAGTCTTATTAGTCACTTTA 

SEQ2 302 AA - GGCGGTTTATTGATGTTTTTGAGTTTAATAGGATGGC TAGTCTTATTAGTCACTTTA 

SEQ23 0 3 AA- GGCGGTTTATTGATGTTTTTGAGTTTAATAGGATGGCTAGTCTTATTAGTCACTTTA 

SEQ23 04 AA-GGCGGTTTATTGATGTTTTTGAGTTTAATAGGATGGCTAGTCTTATTAGTCACTTTA 

SEQ23 0 5 AA-GGCGGTTTATTGATGTTTTTGAGTTTAATAGGATGGCTAGTCTTATTAGTCACTTTA 

SEQ2 306 AA- GGCGGTTTATTGATGTTTTTGAGTTTAATAGGATGGCTAGTCTTATTAGTCACTTTA 

SEQ2 307 AA- GGCGGTTTATTGATGTTTTTGAGTTTAATAGGATGGCTAGTCTTATTAGTCACTTTA 

SEQ2 308 AA- GGCGGTTTATTGATGTTTTTGAGTTTAATAGGATGGCTAGTCTTATTAGTCACTTTA 

SEQ2 309 AA - GGCGGTTTATTGATGTTTTTGAGTTTAATAGGATGGCTAGTCT TATTAGTCACTTTA 

SEQ2 310 AAAGGCGGTTTATTGATGTTTTTGAGTTTAATAGGATGGCTAGTCTTATTAGTCACTTTA 

SEQ2 311 AA- GGCGGTTTATTGATGTTTTTGAGTTTAATAGGATGGCTAGTCTTATTAGTCACTTTA 

SEQ2 301 AATTTGTGGC^GGCATGAACGTTGGAGAAAAAAGACGAAGTCAATTAGGTTCTTGTGACT 

S EQ2 302 AATTTGTGGCAGGCATGAACGTTGGAGAAAAAAGACGAAGTCAATTAGGTTCTTGTGACT 

SEQ2 303 AATTTGTGGCAGGCATGAACGTTGGAGAAAAAAGACGAAGTCAATTAGGTTCTTGTGACT 

SEQ2 304 AATTTGTGGCAGGCATGAACGTTGGAGAAAAAAGACGAAGTCAATTAGGTTCTTGTGACT 

SEQ2 305 AATTTGTGGC^GGCATGAACGTTGGAGAAAAAAGACGAAGTCAATTAGGTTCTTGTGACT 

SEQ2 306 AATTTGTGGCAGGCATGAACGTTGGAGAAAAAAGACGAAGTCAATTAGGTTCTTGTGACT 

SEQ2 307 AATTTGTGGCAGGCATGAACGTTGGAGAAAAAAGACGAAGTCAATTAGGTTCTTGTGACT 

SEQ2 308 AATTTGTGGCAGGCATGAACGTTGGAGAAAAAAGACGAAGTCAATTAGGTTCTTGTGACT 

SEQ2 309 AATTTGTGGCAGGCATGAACGTTGGAGAAAAAAGACGAAGTCAATTAGGTTCTTGTGACT 

SEQ2 310 AAT T TGTGG CAGG CAT GAACGT T G GAGAAAAAAGA CGAAGT CAATTAGGT T CTTGTGAC T 

SEQ2 311 AATTTGTGGCAGGCATGAACGTTGGAGAAAAAAGACGAAGTCAATTAGGTTCTTGTGACT 
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Table 23: Comparative Sequences relating to SAG0163 

(competence protein CglA) 



SEQ2301 
SEQ23 02 
SEQ23 03 
SEQ2 304 
SEQ2305 
SEQ2306 
SEQ2307 
SEQ2308 
SEQ2309 
SEQ2310 
SEQ2311 

SEQ2301 
SEQ2 3 02 
SEQ2303 
SEQ2304 
SEQ23 05 
SEQ2306 
SEQ2307 
SEQ2308 
SEQ2309 
SEQ2310 
SEQ2311 



ATGAACTGTC^GAGGGAAGACTGGTTTCATTACGACTATCGAGTGTGGGAGATTATCGTG 
ATGAACTGTCAGAGGGAAGACTGGTTTCATTACGACTATCGAGTGTGGGAGATTATCGTG 
ATGAACTGTCAGAGGGAAGACTGGTTTCATTACGACTATCGAGTGTGGGAGATTATCGTG 
ATGAACTGTCAGAGGGAAGACTGGTTTCATTACGACTATCGAGTGTGGGAGATTATCGTG 
ATGAACTGTCAGAGGGAAGACTGGTTTCATTACGACTATCGAGTGTGGGAGATTATCGTG 
ATGAACTGTCAGAGGGAAGACTGGTTTCATTACGACTATCGAGTGTGGGAGATTATCGTG 
ATGAACTGTCAGAGGGAAGACTGGTTTCATTACGACTATCAAGTGTGGGAGATTATCGTG 
ATGAACTGTCAGAGGGAAGACTGGTTTCATTACGACTATCGAGTGTGGGAGATTATCGTG 
ATGAACTGTCAGAGGGAAGACTGGTTTCATTACGACTATCGAGTGTGGGAGATTATCGTG 
ATGAACTGTCAGAGGGAAGACTGGTTTCATTACGACTATCAAGTGTGGGAGATTATCGTG 
ATGAACTGTCAGAGGGAAGACTGGTTTCATTACGACTATCAAGTGTGGGAGATTATCGTG 

GTCAAGAATCTTTAGTTATTCGTATTTTGTATTCAGGTCATCAGGACTTAAAATATTGGT 
GTCAAGAATCTTTAGTTATTCGTATTTTGTATTCAGGTCATCAGGACTTAAAATATTGGT 
GTCAAGAATCTTTAGTTATTCGTATTTTGTATTCAGGTCATC^GGACTTAAAATATTGGT 
GTCAAGAATCTTTAGTTATTCGTATTTTGTATTCAGGTCATCAGGACTTAAAATATTGGT 
GTCAAGAATCTTTAGTTATTCGTATTTTGTATTCAGGTCATCAGGACTTAAAATATTGGT 
GTCAAGAATCTTTAGTTATTCGTATTTTGTATTCAGGTCATCAGGACTTAAAATATTGGT 
GTCAAGAATCTTTAGTTATTCGTACTTTGTATTCAGGTCATCAGGACTTAAAATATTGGT 
GTCAAGAATCT TTAGTTATTCGTATTT TGTATTCAGGTCATCAGGACTTAAAATATTGGT 
GTCAAGAATCTTTAGTTATTCGTATTTTGTATTCAGGTCATCAGGACTTAAAATATTGGT 
GTCAAGAATCTTTAGTTATTCGTACTTTGTATTCAGGTCATCAGGACTTAAAATATTGGT 
GTCAAGAATCTTTAGTTATTCGTACTTTGTATTCAGGTCATCAGGACTTAAAATATTGGT 



SEQ2301 
SEQ2302 
SEQ2303 
SEQ2304 
SEQ230 5 
SEQ2306 
SEQ2307 
SEQ23 0 3 
SEQ2309 
SEQ2310 
SEQ2311 



T TGATAAT ATAAAGCAAATGAAGGAAGTAC TG GGTACAAGAGGG CTATAT CTTTTTTCCG 
TTGATAATATAAAGCAAATGAAGGAAGTACTGGGTACAAGAGGGCTATATCTTTTTTCCG 
TTGATAATATAAAGCAAATGAAGGAAGTACTGGGTATAAGAGGGCTATATCTTTTTTCCG 
TTGATAATATAAAGCAAATGAAGGAAGTACTGGGTATAAGAGGGCTATATCTTTTTTCCG 
T TGATAATATAAAG CAAATGAAGGAAGTAC TGGGTATAAGAGGG C TAT AT C TTTTTTCCG 

TTGATAATATAAAGCAAATGAAGGAAGTACTGGGTACAAGAGGGCTATATCTTTTTTCCG 
TTGATAATATAAAGTAAATGAAGGAAGTACTGTGTGCAAGAGGGCTATAT CTTTTTTCCG 
TTGATAATATAAAGCAAATGAAGGAAGTACTGGGTATAAGAGGGCTATATCTTTTTTCCG 
TTGATAATATAAAGCAAATGAAGGAAGTACTGGGTATAAGAGGGCTATATCTTTTTTCCG 
T TGATAATAT AAAGTAAATGAAGGAAGTAC TGTGT G CAAGAGGGCTATAT CTTTTTTC CG 
TTGATAATATAAAGCAAATGAAGGAAGTACTGTGTGCAAGAGGGCTATATCTTTTTTCCG 



SEQ23 01 
SEQ23 02 
SEQ2303 
SEQ2304 
SEQ23 05 
SEQ23 0 6 
SEQ2307 
SEQ2308 
SEQ2309 
SEQ2310 
SEQ2311 



GCCCTGTGGGGAGTGGTAAAACAACTCTCATGTATCAATTAGCTTCAGAAGTATTTAAAA 
GCCCTGTGGGGAGTGGTAAAACAACTCTCATGTATCAATTAGCTTCAGAAGTATTTAAAA 
GCCCTGTGGGGAGTGGTAAAACAACTCTCATGTATCAATTAGCTTGAGAAGTATTTAAAA 
GCCCTGTGGGGAGTGGTAAAACAACTCTCATGTATCAATTAGCTTCAGAAGTATTTAAAA 
GCCCTGTGGGGAGTGGTAAAACAACTCTCATGTATCAATTAGCTTCAGAAGTATTTAAAA 
GCCCTGTGGGGAGTGGTAAAACAACTCTCATGTATCAATTAGCTTCAGAAGTATTTAAAA 
GCCCTGTGGGGAGTGGTAAAAGAACTCTCATGTATCAATTAGCTTCAGAAGTATTTAAAA 
GCCCTGTGGGGAGTGGTAAAACAACTCTCATGTATCAATTAGCTTCAGAAGTATTTAAAA 
GCCCTGTGGGGAGTGGTAAAACAACTCTCATGTATCAATTAGCTTCAGAAGTATTTAAAA 
GCCCTGTGGGGAGTGGTAAAACAACTCTCATGTATCAATTAGCTTCAGAAGTATTTAAAA 
GCCCTGTGGGGAGTGGTAAAACAACTCTCATGTATCAATTAGCTTCAGAAGTATTTAAAA 
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Table 23: Comparative Sequences relating to SAG0163 

(competence protein Cgl A) 



SEQ2301 
SEQ2302 
SEQ2303 
SEQ23 04 
SEQ2305 
SEQ230 6 
SEQ2307 
SEQ230 8 
SEQ230 9 
SEQ2310 
SEQ2311 

SEQ23 01 
SEQ2302 
SEQ2303 
SEQ23 04 
SEQ23 05 
SEQ2306 
SEQ23 07 
SEQ23 0 8 
SEQ2309 
SEQ2310 
SEQ2311 

SEQ23 01 
SEQ23 02 
SEQ23 03 
SEQ23 04 
SEQ2305 
SEQ2306 
SEQ23 07 
SEQ23 0 8 
SEQ2309 
SEQ2310 
SEQ2311 



ATAAGCAAATTATCACGATTG^^GATCCGGTAGAAATCAAGAATGACAAGATGTTACAAC 

ATAAGCAAATTATCACGATTGAAGATCCGGTAGAAATCAAGAATGACAAGATGTTACAAC 

ATAAGCAAATTATC^CGATTGAAGATCCGGTAGAAATCAAGAATGACAAGATGTTA(^AC 

ATAAGCAAATTATC^CGATTGAAGATCCGGTAGAAATCAAGAATGACAAGATGTTACAAC 

ATAAGCAAATTATC^CGATTGAAGATCCGGTAGAAATCMGAATGAC^GATGTTACAAC 

ATAAGCAAATTATCACGATTGAAGATCCGGTAGAAATCAAGAATGACAAGATGTTACAAC 

ATAAGCAAATTATCTlCGATTGAAGATCCGGTAGAAATCAAGAATGACAAGATGTT^ 

ATAAGCAAATTATC^CGATTGAAGATCCGGTAGAAATC^GAATGACAAGATGTTAC^C 

ATAAG(^AATTATCACGATTGAAGATCCGGTAGAAATC^GAATGACAAGATGTTACAAC 

ATAAGCAAATTATCACGATTGAAGATCCGGTAG 

ATAAGCAAATTATCACGATTGAAGATCCGGTAGAAATCAAGAATGACAAGATGTTACAAC 

TCCAATTGAATGAGGATATTGGAATGACTTATGATGCTTTAATCAAACTGTCTTTACGGC 
TCCAATTGAATGAGGATATTGGAATGACTTATGATGCTTTAATCAAACTGTCTTTACGGC 
TCCAATTGAATGAGGATATTGGAATGACTTATGATGCTTTAATCAAACTGTCTTTACGGC 
TCCAATTGAATGAGGATATTGGAATGACTTATGATGCTTTAATCAAACTGTCTTTACGGC 
TCCAATTGAATGAGGATATTGGAATGACTTATGATGCTTTAATCAAACTGTCTTTACGGC 
TCCAATTGAATGAGGATATTGGAATGACTTATGATGCTTTAATCAAACTGTCTTTACGGC 
TCCAATTGAATGAGGATATTGGAATGACTTATGATGCTTTAATCAAACTGTCTTTACGGC 
TCCAATTGAATGAGGATATTGGAATGACTTATGATGCTTTAATCAAACTGTCTTTACGGC 
TCCAATTGAATGAGGATATTGGAATGACTTATGATGCTTTAATCAAACTGTCTTTACGGC 
TCCAATTGAATGAGGATATTGGAATGACTTATGATGCTTTAATCAAACTGTCTTTACGGC 
TCCAATTGAATGAGGATATTGGAATGACTTATGATGCTTTAATCAAACTGTCTTTACGGC 

AT CGT C CAGAT AT T T TAAT TAT CGGAG AGAT - TAGAGATCAAGCGACGGCCCGTGCTGTT 
AT C GT C CAGATAT TT TAAT TAT CGGAGAGAT - TAGAGATCAAGCGACGGCTCGTGCTGTT 
ATCGTCCAGATATTTTAATTATCGGAGAGAT - TAGAGATCAAGCGACGGCCCGTGCTGTT 
AT CGTCCAGATATTTTAAT TAT CGGAGAGAT - TAGAGATCAAGCGACGGCCCGTGCTGTT 
ATCGTC CAGATAT TT TAAT TAT CGGAGAGAT - TAGAGATCAAGCGACGGCCCGTGCTGTT 
ATCGTC CAGATATTTTAATTAT CGGAGAGAT - TAGAGATCAAGCGACGGCCCGTGCTGTT 
ATCGTCCAGATATTTTAATTATCGGAGAGAT - TAGAGATCAAGCGACGGCCCGTGCTGTT 
ATCGTCCA.GATATTTTAATTATCGGAGAGAAA TAGAGATCAAGCGACGGCCCGTGCTGTT 
ATCGTCCAGATATTTTAATTATCGGAGAGAT - TAGAGATCAAGCGACGGCCCGTGCTGTT 
ATCGTCCAGATATTTTAATTATCGGAGAGAT - TAGAGATCAAGCGACGGCCCGTGCTGTT 
ATCGTCCAGATATTTTAATTATCGGAGAGAT - TAGAGATCAAGCGACGGCCCGTGCTGTT 



SEQ2301 
SEQ2302 
SEQ23 0 3 
SEQ2304 
SEQ2305 
SEQ2306 
SEQ2307 
SEQ23 0 8 
SEQ23 0 9 
SEQ2310 
SEQ2311 



ATTCGTGCAAGTTTAACGGGAGTGATGGTTTTTTCTACTATTCATGCTAAAAGTATTTCC 
ATTCGTGCAAGTTTAACGGGAGTGATGGTTTTTTCTACTATTCATGCTAAAAGTATTCCC 
ATTCGTGCAAGTTTAACGGGAGTGATGGTTTTTTCTACTATTCATGCTAAAAGTATTCCC 
ATTCGTGCAAGTTTAACGGGAGTGATGGTTTTTTCTACTATTCATGCTAAAAGTATTCCC 
ATTCGTGCAAGTTTAACGGGAGTGATGGTTTTTTCTACTATTCATGCTAAAAGTATTCCC 
ATTCGTGCAAGTTTAACGGGAGTGATGGTTTTTTCTACTATTCATGCTAAAAGTATTTCC 
ATTCGTGCAAGTTTAACGGGAGTAATGGTTTTTTCTACTATTCATGCTAAAAGTATTCCC 
ATTCGTGCAAGTTTAACGGGAGTGATGTTTTTTTCTACTATTCATGCTAAAAGTATTCCC 
ATTCGTGCAAGTTTAACGGGAGTGATGGTTTTTTCTACTATTCATGCTAAAAGTATTCCC 
ATTCGTGCAAGTTTAACGGGAGTAATGGTTTTTTCTACTATTCATGCTAAAAGTATTCCC 
ATTCGTGCAAGTTTAACGGGAGTAATGGTTTTTTCTACTATTCATGCTAAAAGTATTCCC 



SEQ2301 
SEQ23 02 
SEQ23 0 3 
SEQ23 04 
SEQ2305 
SEQ2306 
SEQ2307 
SEQ230 8 
SEQ2309 
SEQ2310 
SEQ2311 

SEQ23 01 
SEQ2302 
SEQ2303 
SEQ23 04 
SEQ23 0 5 
SEQ23 06 
SEQ23 07 



GGAGTCTATGATAGGCTTATAGAATTAGGGGTTAACTATCAAGAGTTAGAAAATAGTCTA 
GGAGT C TAT GATAGGC T TATAGAAT TAGGGGT TAA C TAT CAAGAGT TAGAAAATAGT CTA 
GGAGT C TATGATAG G C T T ATAG AAT TAGGGGT TAA CTAT CAAGAGT TAGAAAATAGT C T A 
GGAGTCTATGATAGGCTTATAGAATTAGGGGTTAACTATCAAGAGTTAGAAAATAGTCTA 
GGAGTCTATGATAGGCTTATAGAATTAGGGGTTAACTATCAAGAGTTAGAAAATAGTCTA 
GGAGTCTATGATAGGCTTATAGAATTAGGGGTTAACTATCAAGAGTTAGAAAATAGTCTA 
GGAGTCTATGATAGGCTTATAGAATTAGGGGTTAACTATCAAGAGTTAGAAAATAGTCTA 
GGAGTCTATGATAGGCTTATAGAATTAGGGGTTAACTATCAAGAGTTAGAAAATAGTCTA 
GGAGTCTATGATAGGCTTATAGAATTAGGGGTTAACTATCAAGAGTTAGAAAATAGTCTA 
GGAGTCTATGATAGGCTTATAGAATTAGGGGTTAACTATCAAGAGTTAGAAAATAGTCTA 
GGAGTCTATGATAGGCTTATAGAATTAGGGGTTAACTATCAAGAGTTAGAAAATAGTCTA 

AAATTAATAGCATATCAACGTTTAATTGGAGGAGGAAGCCTAATTGACTTTGAGACAGGT 
AAATTAATAGCATATCAACGTTTAATTGGAGGAGGAAGCCTAATTGACTTTGAGACAAGT 
AAATTAATAGCATATCAACGTTTAATTGGAGGAGGAAGCCTAATTGACTTTGAGACAGGT 
AAATTAATAGCATATCAACGTTTAATTGGAGGAGGAAGCCTAATTGACTTTGAGACAGGT 
AAATTAATAGCATATCAACGTTTAATTGGAGGAGGAAGCCTAATTGACTTTGAGACAGGT 
AAATTAATAGCATATCAACGTTTAATTGGAGGAGGAAGCCTAATTGACTTTGAGACAGGT 
AAATTAATAGCATATCAACGTTTAATTGGAGGAGGAAGCCTAATTGACTTTGAGACAAGT 
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SEQ230 8 
SEQ230 9 
SEQ2310 
SEQ2311 

SEQ23 01 
SEQ2302 
SEQ2303 
SEQ2304 
SEQ23 0 5 
SEQ2306 
SEQ2307 
SEQ230 8 
SEQ2309 
SEQ2310 
SEQ2311 

SEQ2301 
SEQ23 02 
SEQ2303 
SEQ2304 
SEQ23 05 
SEQ2306 
SEQ2307 
SEQ230 8 
SEQ2309 
SEQ2310 
SEQ2311 

SEQ2301 
SEQ2302 
SEQ23 03 
SEQ23 04 
SEQ2305 
SEQ2306 
SEQ2307 
SEQ2308 
SEQ230 9 
SEQ2310 
SEQ2311 



AAATTAATAGCATATCAACGTTTAATTGGAGGAGGAAGCCTAATTGACTTTGAC3ACAGGT 
AAATTAATAGCATATCAACGTTTAATTGGAGGAGGAAGCCTAATTGACTTTGAGACAGGT 
AAATTAATAGCATATCAACGTTTAATTGGAGGAGGAAGCCTAATTGACTTTGAGACAAGT 
AAATTAATAGCATATCAACGTTTAATTGGAGGAGGAAGCCTAATTGACTTTGAGACAAGT 

AACTTTAAAAAACACTCATCAGACAAGTGGAATAGACAAGTGGATATCTTGGCTGAAGAA 
AACTTTAAAAAAC^CTC^TCAGACAAGTGGAATAGACAAGTGGATATCTTGGCTGAAGAA 
AATTTTAAAAAACACTCATCAGACAAGTGGAATAGACAAGTGGATATCTTGGCTGAAGAA 
AATTTTAAAAAACACTCATCAGACAAGTGGAATAGACAAGTGGATATCTTGGCTGAAGAA 
AATTTTAAAAAACACTCATCAGACAAGTGGAATAGACAAGTGGATATCTTGGCTGAAGAA 
AACTTTAAAAAACACTCATCAGACAAGTGGAATAGACAAGTGGATATCTTGGCTGAAGAA 
AACTTTAAAAAACACTCATCAGACAAGTGGAATAGACAAGTGGATATCTTGGCTGAAGAA 
AATTTTAAAAAACACTCATCAGACAAGTGGAATAGACAAGTGGATATCTTGGCTGAAGAA 
AATTTTAAAAAACACTCATCAGACAAGTGGAATAGACAAGTGGATATCTTGGCTGAAGAA 
AACTTTAAAAAACACTCATCAGACAAGTGGAATAGACAAGTGGATATCTTGGCTGAAGAA 
AACTTTAAAAAACACTCATCAGACAAGTGGAATAGACAAGTGGATATCTTGGCTGAAGAA 

GGACATATCAGTAAGAAACAGGCACAAGT - CGAAAAAATTATCCCTCAAGAAACAACGGA 
GGATATATCAGTAAGAAACAGGCACAAGT - CGAAAAAATTATCCCTCAAGAAACAACGGA 
GGACATATCAGTAAGAAACAGGCACAAGT - CGAAAAAATTATCCCTCAAGAAACAACGGA 
GGACATATCAGTAAGAAACAGGC^ACAAGTGCGAAAA 

GGACATATCAGTAAGAAACAGGCACAAGT - CGAAAAAATTATCCCTCAAGAAACAACGGA 
GGACATATCAGTAAGAAACAGGCACAAGT - CGAAAAAATTATCCCTCAAGAAACAACGGA 
GGACATATCAGTAAGAAACAGGCACAAGT - CGAAAAAATTATCCCTCAAGAAACAACGGA 
GGACATATCAGTAAGAAACAGGCACAAGT - CGAAAAAATTATCCCTCAAGAAACAACGGA 
GGACATATCAGTAAGAAACAGGCACAAGT- CGAAAAAATTATCCCTCAAGAAACAACGGA 
GGACATATCAGTAAGAAACAGGCACAAGT- CGAAAAAATTA 

GGACATATCAGTAAGAAACAGGCACAAGT - CGAAAAAATTATCCCTCAAGAAACAACGGA 

AAGTAGTCCAACTTTT 

AAGTAGTCCAACTTTT 

AAGTAGTCCAACTTTT 

AAGTAGTCCAACTTTT 

AAGTAGTCCAACTTTT , 

AAGTAGTCCAACTTTT 

AAGTAGTCCAACTTTT 

AAGTAGTCCAACTTTT 

AAGTAGTCCAACTTTT 

AAGTAGTCCAACTTTT 

AAGTAGTCCAACTTTT 



>SEQ ID NO 2350:63_090 frame: 2 

AVEVNAQDIYIIPKGDCYELYMRIDDERRFIDVFEFNRMASLISHFKFVAGMNVGEKRRS 
QLGSCDYELSEGRLVSLRLSSVGDYRGQESLVIRILYSGHQDLKYWFDNIKQMKEVLGTR 
GLYI1FSGPVGSGKTTLMYQLASEVFKIJKQIITIEDPVEIKNDKMLQLQI1NEDIGMTYDAL 
IKLSLRHRPDILIIGEIRDQATARAVIRASLTGVMVFSTIHAKSISGVYDRLIELGVNYQ 
ELENSLKLIAYQRLIGGGSLIDFETGNFKKHSSDKWNRQVDIIiAEEGHISKKQAQVEKII 
PQETTESSPTF 

>SEQ ID NO 2351: 63_1169NT frame: 3 

.LL.NLYYCVFDDERRFIDVFEFNRMASLISHFKFVAGMNVGEKRRSQLGSCDYELSEGR 
LVSLRLSSVGDYRGQESIiVIRILYSGHQDIiKyWFDNIKQMKEVLGTRGLYLFSGPVGSGK 
TTLMYQLAS EVFKNKQ 1 1 T I EDP VE I KNDKMLQLQLNED I GMT YDALI KLS LRHRPDIL I 
IGEIRDQATARAVIRASIiTGVMVFSTIHAKSIPGVYDRIjIELGVNYQELENSLKLIAYQR 
LIGGGSLIDFETSNFKKHSSDKWNRQVDILAEEGYISKKQAQVEKIIPQETTESSPTF 

>SEQ ID NO 2352 :63 18RS21 frame: 1 

VQSLAKQVIHQAVEVNAQDIYIIPKGDCYELYMRIDDERRFIDVFEFNRMASLISHFKFV 
AGMNVGEKRRS QLGS CD YELS EGRLVSLRL S S VGD YRGQESLVI RI LYSGHQDLKYWFDN 
I KQMKE VLGI RGL YLFS GP VGS GKTTLMYQLAS EVFKNKQ 1 1 T I EDPVE I KNDKML QLQL 
NEDIGMTYDAL I KLSLRHRPDI LI IGEIRDQATARAVIRASLTGVMVFSTIHAKS I PGVY 
DRLIELGVtTYQELENSLKLIAYQRLIGGGSLIDFETGNFK^ 
SKKQAQVEKI I PQETTESSPTF 

>SEQ ID NO 2353: 63 2603 frame: 1 

DIYIIPKGDCYELYMRIDDERRFIDVFEFNRMASIilSHFKFVAGMNVGEKRRSQLGSCDY 
ELSEGRLVSLRLSSVGDYRGQESIiVIRILYSGHQDLKYWFDNIKQMKEVLGIRGLYLFSG 
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PVGSGKTTLMYQLASEVFKNKQI I TIEDPVEI KNDKMLQLQLNEDIGMT YDALI KLSLRH 
RPDILIIGEIRDQATARAVIRASLTGVMVFSTIHAKSIPGVYDRLIELGVNYQELENSLK 
L I A YQRL I GGGSL I DFETGNFKKHS SDKWNRQVD I LAEEGH I S KKQAQVRKNYP SRNNGK 
.SNF 

>SEQ ID NO 2354:63_A909 frame: 1 

VQSLAKQVIHQAVEVNAQDI YI I PKGDCYELYMRIDDERRFIDVFEFNRMASLISHFKFV 
AGMNVGEKRRSQLGSCDYELSEGRIjVSLRLSSVGDYRGQESLVIRILYSGHQDLKYWFDN 
IKQMKEVLGIRGLYLFSGPVGSGKTTLMYQIASEVFKNKQI ITIEDPVE I KNDKMLQLQL 
NEDIGMTYDALIKLSLRHRPDI LI IGE IRDQATARAVIRASLTGVMVFSTIHAKS I PGVY 
DRL I EliGVNYQELENSLKXIAYQRL I GGGSL I DFETGNFKKHS SDKWNRQVDILAEEGHI 
SKKQAQVEKII PQETTES S PTF 

>SEQ ID NO 2355: 63_CJB110 frame: 1 

VQSLAKQVIHQAVE VNAQD I YI I P KGD C YEL YMRIDDERRF IDVFEFNRMASLI SHFKFV 
AGMNVGEKRRSQLGSCDYELSEGRLVSLRLSSVGDYRGQESLVIRILYSGHQDLKYWFDN 
IKQMKEVLGTRGLYLFSGPVGSGKTTLMYQLASEVFKNKQI ITIEDPVEIKNDKMLQLQL 
NEDIGMTYDALI KLSLRHRPDILI IGE I RDQATARAVIRASLTGVMVFSTIHAKS I SGVY 
DRLIELGVNYQELENSLKLIAYQRLIGGGSLIDFETGNFKKHSSDKWNRQVDILAEEGHI 
SKKQAQVEKI I PQETTESS PTF 

>SEQ ID NO 2356:63_CJB110 frame: 1 

VQSLAKQVIHQAVE VNAQD I Y 1 1 PKGDC YEL YMRI DDERRF IDVFE FNRMAS L I SHFKFV 

AGMNVGEKRRS QLGS CD YELS EGRLVSLRLS S VGD YRGQE SLVIRIL YS GHQDLKYWFDN 

I KQMKEVLGTRGL YLF S GPVGS GKTTLM YQLASEVFKNKQ 1 1 T I ED PVE I KNDKMLQLQL 

NEDIGMTYDALIKLSLRHRPDILIIGEIRDQATARAVIRASLTGVMVFSTIHAKSISGVY 

DRLIELGVNYQELENSLKLIAYQRLIGGGSLIDFETGNFKKHSSDKWNRQVDILAEEGHI 

SKKQAQVEKI I PQETTESS PTF 

>SEQ ID NO 2357: 63_H36B frame: 1 

SLAKQVIHQAVEVNAQDIYIIPKGDCYELYMRIDDERRFIDVFEFNRMASLISHFKFVAG 
MNVGEKRRSQLGSCDYELSEGRLVSLRLSS VGD YRGQE SLVIRILYSGHQDLKYWFDNIK 
QMKEVLGIRGLYLFSGPVGSGKTTLMYQLASEVFKNKQ I ITIEDPVE I KNDKMLQLQLNE 
DIGMTYDALIKLSLRHRPDILIIGEK 

>SEQ ID NO 2358 :63_JM913 0013 frame: 1 

VQSLAKQVIHQAVEVNAQDIYIIPKGDCYELYMRIDDERRFIDVFEFNRMASLISHFKFV 
AGMNVGEKRRSQLGSCDYELSEGRLVSLRLSSVGDYRGQESLVIRILYSGHQDLKYWFDN 
IKQMKEVLGIRGLYLFSGPVGSGKTTLI^QLASEVFKNKQIITIEDPVEIKNDKMLQLQL 
NEDIGMTYDALI KLSLRHRPDILI IGE I RDQATARAVIRASLTGVMVFSTIHAKS I PGVY 
DRL I ELGVNYQELENS LKL I AYQRL I GGGS L I DFETGNFKKHS SDKWNRQVDI LAEEGH I 
SKKQAQVEKI I PQETTESS PTF 

>SEQ ID NO 2359:63_M732 frame: 3 

TCYETLYAYLMMKRRFIDVFEFNRMASLISHFKFVAGMNVGEKRRSQLGSCTJYELSEGRL 
VSLRLS S VGD YRGQES LVI RTL YS GHQDLKYWFDNI K . MKEVL CARGL YL FSGP VGSGKT 
TLMYQLASEVFKNKQIITIEDPVEIKNDKMLQLQLNEDIGMTYDALIKLSLRHRPDILII 
GE I RDQATARAVIRASLTGVMVFSTIHAKS I PGVYDRLIELGVNYQELENSLKLIAYQRL 
IGGGSLIDFETSNFKKHSSDKWNRQVDILAEEGHI SKKQAQVEKI I PQETTES S PTF 

>SEQ ID NO 2360:63JM781 frame: 3 

VEVNAQDI YI I PKGDCYEF YMRIDDERRFIDVFEFNRMASLISHFKFVAGMNVGEKRRSQ 
LGS CDYEL S EGRLVSLRLS S VGD YRGQE SLVI RTLYSGHQDLKYWFDNI KQMKEVL CARG 
L YL FSGP VGS GKTTLM YQLAS EVF KNKQ 1 1 TI EDP VE I KNDKMLQLQLNED IGMT YDAL I 
KLSLRHRPDIL I IGEI RDQATARAVIRASLTGVMVFSTIHAKS I PGVYDRLIELGVNYQE 
LENSLKLIAYQRLIGGGSLIDFETSNFKKHSSDKWNRQVDILAEEGHISKKQAQVEKIIP 
QETTESSPTF 

>SEQ ID NO 2361:63_C0H1 frame: 3 

VIVMKFYMRIDDERRFIDVFEFNRMASLISHFKFVAGMNVGEKRRSQLGSCDYELSEGRL 
VS LRLS S VGD YRGQESLVI RTL YS GHQDLKYWFDNI K 
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SEQ2350 AVEVNAQDIYIIPKGDCYELYMRIDDERRFIDVFEFNRMASLISHFKFV 

SKQ2 351 ■ LLNLYYCVFDDERRFIDVFEFNRMASLISHFKFV 

SEQ2352 QSLAKQVIHQAVEWAQDIYIIPKGDCYELYMRIDDERRFIDVFEFNRMASLISHFKFV 

SEQ2 353 D I YI I PKGDC YEL YMRI DDERRF I DVFEFNRMAS L I S HFKFV 

SEQ23 54 QSLAKQVIHQAVEVNAQD I YI I PKGDC YEL YMRI DDERRFI DVFEFNRMAS LI SHFKFV 

SEQ2355 QSIjAKQVIHQAVEVNAQDIYIIPKGDCYELYMRIDDERRFIDVFEFNRMASLISHFKFV 

SEQ23 56 QSLAKQVIHQAVEVNAQDI YI I PKGDCYELYMRIDDERRFIDVFEFNRMASLISHFKFV 

SEQ2357 -SliAKQVIHQAVEWAQDIYIIPKGDCYELYMRIDDERRFIDVFEFNRMASLISHFKFV 

SEQ2 358 QS LAKQVI HQAVEVNAQDI YI I PKGDC YEIi YMRI DDERRF IDVFEFNRMAS L I S HFKFV 

SEQ2359 TC YETL YA YLMMKRRF I DVFE FNRMAS L I SHFKFV 

SEQ2 360 VEVNAQDI YI I PKGDCYEFYMRIDDERRFIDVFEFNRMASLI SHFKFV 

SEQ2361 VI VMKF YMRIDDERRFIDVFEFNRMASLI SHFKFV 

SEQ2 350 AGMNVGEKRRSQLGSCDYELSEGRLVSLRLSSVGDYRGQESLVIRILYSGHQDLKYWFDN 

SEQ2 351 AGMNVGEKRRSQLGSCDYELSEGRLVSLRLSSVGD YRGQESLVI RILYSGHQDLKYWFDN 

SEQ2 352 AGMNVGEKRRSQLGSCDYELSEGRLVSLRLSSVGDYRGQESLVIRILYSGHQDLKYWFDN 

SEQ2 353 AGMNVGEKRRS QLGS CDYELSEGRLVS LRLS S VGD YRGQESLVI R I L YS GHQDLKYWFDN 

SEQ2 354 AGMNVGEKRRSQLGSCDYELSEGRLVSLRLSS VGD YRGQESLVIRIL YSGHQDLKYWFDN 

SEQ2 355 AGMNVGEKRRSQLGSCDYELSEGRLVSLRLSSVGD YRGQESLVI RI LYSGHQDLKYWFDN 

SEQ2 356 AGMNVGEKRRS QLGS CD YELSEGRLVSLRLSS VGD YRGQESLVI RI LYSGHQDLKYWFDN 

SEQ2 357 AGMNVGEKRRSQLGSCDYELSEGRLVSLRLSSVGDYRGQESLVIRILYSGHQDLKYWFDN 

SEQ2 358 AGMNVGEKRRSQLGSCDYELSEGRLVSLRLSSVGDYRGQESLVIRI LYSGHQDLKYWFDN 

SEQ2 359 AGMNVGEKRRSQLGSCDYELSEGRLVSLRLSSVGDYRGQESLVIRTLYSGHQDLKYWFDN 

SEQ2 360 AGMNVGEKRRS QLGS CD YELS EGRLVS LRLS S VGD YRGQE SLVIRTL YSGHQDLKYWFDN 

SEQ2361 AGMNVGEKRRSQLGSCDYELSEGRLVSLRLSS VGD YRGQESLVIRTL YSGHQDLKYWFDN 

SEQ2350 IKQMKEVLGTRGLYLFSGPVGSGKTTLMYQLASEVFKNKQIITIEDPVEIKNDKMLQLQL 

SEQ2351 IKQMKEVLGTRGLYLFSGPVGSGKTTLMYQLASEVFKNKQIITIEDPVEIKNDKMLQLQL 

SEQ2352 IKQMKEVLGIRGLYLFSGPVGSGKTTLMYQLASEVFKNKQIITIEDPVEIKNDKMLQLQL 

SEQ2353 IKQMKEVLGIRGLYLFSGPVGSGKTTLMYQLASEVFKNKQIITIEDPVEIKNDKMLQLQL 

SEQ2 354 IKQMKEVLGIRGLYLFSGPVGSGKTTLMYQLASEVFKNKQIITIEDPVEIKNDKMLQLQL 

SEQ2 355 I KQMKEVLGTRGL YLFS GPVGS GKTTLM YQLASEVFKNKQ I ITIEDPVEIKNDKMLQLQL 

SEQ2 356 I KQMKEVLGTRGL YLFS GPVGS GKTTLMYQLAS EVFKNKQ I ITIEDPVEIKNDKMLQLQL 

SEQ2357 IKQMKEVLGIRGLYLFSGPVGSGKTTLMYQLASEVFKNKQI ITIEDPVEIKNDKMLQLQL 

SEQ2 358 IKQMKEVLGIRGLYLFSGPVGSGKTTLMYQLASEVFKNKQI ITIEDPVEIKNDKMLQLQL 

SEQ2 359 IK- MKEVLCARGLYLFSGPVGSGKTTLMYQLASEVFKNKQI ITIEDPVEIKNDKMLQLQL 

SEQ23 6 0 IKQMKEVLCARGLYLFSGPVGSGKTTLMYQLASEVFKNKQIITIEDPVEIKNDI^ILQLQL 

SEQ2361 IK 

SEQ2350 ED I GMT YDAL I KL S LRHRPD I L 1 1 GEI RDQATARAVI RASLTGVMVFS TIHAKS I S GVY 

SEQ2351 EDIGMTYDALIKLSLRHRPDILIIGEIRDQATARAVIRASLTGVMVFSTIHAKSIPGVY 

SEQ2352 EDIGMTYDALIKLSLRHRPDILIIGEIRDQATARAVIRASLTGVMVFSTIHAKSIPGVY 

SEQ2353 EDIGMTYDALIKLSLRHRPDILIIGEIRDQATARAVIRASLTGVMVFSTIHAKSIPGVY 

SEQ2354 EDIGMTYDALIKLSLRHRPDILIIGEIRDQATARAVIRASLTGVMVFSTIHAKSIPGVY 

SEQ2355 EDIGMTYDALIKLSLRHRPDILIIGEIRDQATARAVIRASLTGVMVFSTIHAKSISGVY 

SEQ2356 ED I GMT YDAL I KLS LRHRPDI L 1 1 GE I RDQATARAVI RAS LTGVMVFS TIHAKS I S GVY 

SEQ23 57 ED I GMT YDAL I KL S LRHRPDI L 1 1 GEK 

SEQ2358 EDIGMTYDALIKLSLRHRPDILIIGEIRDQATARAVIRASLTGVMVFSTIHAKSIPGVY 

SEQ2359 EDIGMTYDALIKLSLRHRPDILIIGEIRDQATARAVIRASLTGVMVFSTIHAKSIPGVY 

SEQ2360 EDIGMTYDALIKLSLRHRPDILIIGEIRDQATARAVIRASLTGVMVFSTIHAKSIPGVY 

SEQ2361 

SEQ23 50 RLIELGVNYQELENSLKLIAYQRLIGGGSLIDFETGNFKKHSSDKWNRQVDILAEEGHI 

SEQ2351 RLIELGVNYQELENSLKLIAYQRLIGGGSLIDFETSNFKKHSSDKIWRQVDILAEEGYI 

SEQ2352 RLIELGVNYQELENSLKLIAYQRLIGGGSLIDFETGNFKKHSSDKWNRQVDILAEEGHI 

SEQ2353 RLIELGVNYQELENSLKLIAYQRLIGGGSLIDFETGNFKKHSSDKWNRQVDILAEEGHI 

SEQ2354 RLIELGVNYQELENSLKLIAYQRLIGGGSLIDFETGNFKKHSSDKWNRQVDILAEEGHI 

SEQ2355 RLIELGVNYQELENSLKLIAYQRLIGGGSLIDFETGNFKKHSSDKWNRQVDILAEEGHI 

SEQ2356 RLIELGVNYQELENSLKLIAYQRLIGGGSLIDFETGNFKKHSSDKWNRQVDILAEEGHI 

SEQ2357 

SEQ2358 RLIELGVNYQELENSLKLIAYQRLIGGGSLIDFETGNFKKHSSDKWNRQVDILAEEGHI 

SEQ23 59 RLIELGVNYQELENSLKLIAYQRLIGGGSLIDFETSNFKKHSSDKWNRQVDILAEEGHI 

SEQ23 60 RLIELGVNYQELENSLKLIAYQRLIGGGSLIDFETSNFKKHSSDKWNRQVDILAEEGHI 

SEQ2361 
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SEQ2350 
SEQ2351 
SEQ2352 
SEQ2353 
SEQ2354 
SEQ2355 
SEQ235 6 
SEQ2357 
SEQ2358 
SEQ2359 
SEQ2360 
SEQ2361 



KKQAQVEKI IPQETTESSPTF 
KKQAQVEKI I PQETTES SPTF 
KKQAQVEKI IPQET TESSPTF 
KKQAQVRKNYPSRNNGKSNF- 
KKQAQVEKI IPQETTESSPTF" 
KKQAQVEKI IPQETTESSPTF 
KKQAQVEKI IPQETTESSPTF 

KKQAQVEKI I PQETTES SPTF 
KKQAQVEKIIPQETTESSPTF 
KKQAQVEKI IPQETTESSPTF 
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Figure 24: Comparative Sequences relating to SAG0290 
(ABC transporter, substrate-binding protein) 

SEQ ID NO. 2401s SAG0290 FROM THE 1169NT1 GBS TYPE V STRAIN (REVERSE 
COMPLEMENT) 

GTATCAGTTCAGGCGTGAGAGAAAGTAGAACTTAAAGTAGCTACAGAT^ 
TCAAAAAGACGGGAAATTCAAAGGTTAT 

AAGTAACCTTCAAGACAGTTCCTTTTGATACTATTT CAACAGGTATTGATGCAGGGAAATTTGATTTAT CA 

GCTAATGATTTTTCATACAATAAAGAAAGAGCAGAAAAATATCTCTTCTCAGACCCTATATCCCGTTCAAA 

TTATGCCGTAGTAGGGAAGAAGGGGAGCCATTACAAATCATTAAGTGACCTCTCTGGAAAATCAACAGAAG 

TTTTATCTGGCGTTAACTATGCACAGGTTCTAGAAAATTGGAATAAAAATCATCCTAATAAAAAACCA^ 

AAAATCAAATATGTTTCTGGK5ACAACTGGTGTTA 

CTTTATCCTATATGATGCCATTTCATCTGACTATATTGTAAAAGATCAATCATTAAACTTAAGCGTTTCTC 
CTTTGAAAGGTAAAATTGGTAATAATAAGGATGGATTAGAATACCTCCTTTTACCAAAAGATAAAAAAGGT 
AAAACTCTACAGAAATTTATAAATAAGCGTATTAAAGTTTTGAAAGAAGATGGTACTTTGGCACGTTTAAG 
TAAACAATATTTCGGTGGAGATTACGTTTCAAACATTGATAAA 

SEQ ID NO. 2402: SAG0290 FROM THE 18RS21 GBS TYPE II STRAIN (REVERSE 
COMPLEMENT) 

GTATCAGTTCAGGCGTCAGAGAAAGTAGAACTTAAAGTAGCTACAGATTCTGACACGGCACCATTTACTTA 
TRAAAAAGACGGGAAATTCAAAGGTTATGATGTTGATGTTGTCAAAGCTGTTTTTAAAGGTAGTAAGTACA 
AAGTAACCTTCAAGACAGTTCCTTTTGATACTATTTCAACAGGTATTGATGCAGGGAAATTTGATTTATCA 
GCTAATGATTTTTGATACAATAAAGAAAGAGCAGAAAAATATCTCTTCTCAGATCCTATATCCCGTTCAAA 
TTATGC CGTAGTAGGGAAGAAGGGGAGCCATTACAAAT CAT TAAGTGACCT CTCTGGAAAATCAACCGAAG 
TTTTATCTGGCGTTAACTATGCACAGGTTCTAGAAAATTGGAATAAAAATCATCCTAATAAAAAACCAATA 
AAAATCAAATATGTTTCTGGGACAACTGGTGTTACTAGCAGATTAAAAAATATTGAGAGTGGGAAAATTGA 
CTTTATCCTATATGATGCCATTTCATCCGACTATATTGTAAAAGACCAATCATTAAACTTAAGCGTTTCTC 
CTTTGAAAGGTAAAATTGGTAATAATAAC^ATGGACTAGAATACCTCCTTTTACCAAAAGATAAAAAAGGT 
AAAACTCTACAGAAATTTATAAATAAGCGTATTAAAGTTTTGAAAGAAAATGGTACTTTGGCACGTTTAAG 
TAAACAATATTTCGGTGGAGATTACGTTTCAAACATTGATAAA 

SEQ ID NO. 2403s SAG0290 FROM THE 2603 V/R GBS TYPE V STRAIN (REVERSE 
COMPLEMENT) 

ATTCAAAGGTTATGATGTTGATGTTGTCAAAGCTGTTTTTAAAGGTAGTAAGTACAAAGTAACCTTCAAGA 
CAGTTCCTTTTGATACTATTTCAACAGGTATTGATGCAGGGAAATTTGATTTATCAGCTAATGATTTTTCA 
TACAATAAAGAAAGAGCAGAAAAATAT CTCTT CTCAGAT C C TAT AT C C CGTTCAAATTATGC CGTAGTAGG 
GAAGAAGGGGAGCCATTACAAATCATTAAGTGACCTCTCTGGAAAATCAACCGAAGTTTTATCTGGCGTTA 
ACTATGCACAGGTTCTAGAAAATTGGAATAAAAATCATCCTAATAAAAAACC^^ 

TCTGGGACAACTGGTGTTACTAGCAGATTAAAAAATATTGAGAGTGGGAAAATTGACTTTATCCTATATGA 
TGCCATTTCATCCGACTATATTGTAAAAGACCAATCATTAAACTTAAGCGTTTCTCCTTTGAAAGGTAAAA 
TTGGTAATAATAAGGATGGACTAGAATACCT C CTTTTACCAAAAGATAAAAAAG 

SEQ ID NO. 2404: SAG0290 FROM THE 090 GBS TYPE la STRAIN (REVERSE COMPLEMENT) 

GTATCAGTTCAGGCGTCAGAGAAAGTAGAACTTAAAGTAGCTACAGATTCTGACACGGCACCATTTACTTA 
TCAAAAAGACGGGAAATTCAAAGGTTATGATGTTGATGTTGTCAAAGCTGTTTTTAAAGGTAGTAAGTACA 
AAGTAACCTTCAAGACAGTTCCTTTTGATACTATTTCAACAGGTATTGATGCAGGGAAATTTGATTTATCA 
GCTAATGATTTTTCATACAATAAAGAAAGAGCAGAAAAATATCTCTTCTCAGATCCTATATCCCGTTCAAA 
TTATGCCGTAGTAGGGAAGAAGGGGAGCCA.TTACAAATCATTAAGTGACCTCTCTGGAAAATCAACCGAAG 
TTTTATCTGGCGTTAACTATGCACAGGTTCTAGAAAATTGGAATAAAAATCATCCTAATAAAAAACCAATA 
AAAATCAAATATGTTTCTGGGACAACTGGTGTTACTAGCAGATTAAAAAATATTGAGAGTGGGAAAATTGA 
CTTTATCCTATATGATGCCATTTCATCCGACTATATTGTAAAAGACCAATCATTAAACTTAAGCGTTTCTCCT 
TTGAAAGGTAAAATTGGT AAT AAT AAGGATGGACTAGAATAC CT C CT TTTAC CAAAAGAT AAAAAAGGTAAAACTCTA 
CAGAAATTTATAAATAAGCGTATTAAAGTTTTGAAAGAAAATGGTACTTTGGCACGTTTAAGTAAACAATATTTCGGT 
GGAGATTAC GTT TC AAAC ATTGAT AAA 
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Figure 24: Comparative Sequences relating to SAG0290 
(ABC transporter, substrate-binding protein) 

SEQ ID NO. 2405$ SAG0290 FROM THE A909 GBS TYPE la STRAIN (REVERSE 
COMPLEMENT) 

GTATCAGTTCAGGCGTCAGAGAAAGTAGAACTTAAAGTAGCTACAGATTCTGACACGGCACCATTTACTTA 
TCAAAAAGACGGGAAATTGAAAGGTTATGATGTTGA 

AAGTAACCTTC^^GACAGTTCCTTTTGATACTATTTC^CAGGTATTGATGC^GGGAAATTTGATTTAT^ 
GCTAATGATTTTTCATACAATAAAGAAAGAGCAGAAi^ 

TTATGCCGTAGTAGGGAAGAAGGGGAGCCATTACAAATCATTAAGTGACCTCTCTGGAAAATCA 
TTTTATCTGGCGTTAACTATGCACAGGTTCTAGAAAATTGGAATAAA^^ 

AAAATNAAATATGTTTCTGGGAGAACTGGTGTTACTAGCAGATTAAAAAATATTGAGAGTGGGAAAATTGA 
CTTTATCCTATATGATGCCATTTCATCCGACTATATTGTAAAAGACCAATCATTAAACTTAAGCGTTTCTC 
CTTTGAAAGGTAAAATTGGTAATAATAAGGATGGACTAGAATACCT CCTTTTAC CAAAAGATAAAAAAGGT 
AAAACTCTACAGAAATTTATAAATAAGCGT 

SEQ ID NO. 2406: SAG0290 FROM THE CJB110 GBS NONTYPEABLE STRAIN (REVERSE 
COMPLEMENT) 

GTATCAGTTCAGGCGTCAGAGAAAGTAGAACTTAAAGTAGCTACAGATTCTGACACGGCACCATTTACTTA 

TCAAAAAGACGGGAAATTCAAAGGTTATGATGTTGATGTTGTCAAAGCTGTTTTTAAAGGTAGTAAGTACA 

AAGTAACCTTCAAGACAGTTCCTTTTGATACTATTTCAACAGGTATTGATGCAGGGAAATTTGATTTATCA 

GCTAATGATTTTTCATACAATAAAGAAAGAGCAGAAAAATATCTCTTCTCAGATCCTATATCCCGTTCAAA 

TTATGCCGTAGTAGGGAAGAAGGGGAGCCATTACAAATGATTAAGTGACCTCTCTGGAAAA 

TTTTATCTGGCGTTAACTATGCACAGGTTCTAGAAAATTGGAAT^ 

AAAATCAAATATGTTTCTGGGACAACTGGTGTTACTAGCAGATTAAAAAATATTGAGAGTGGGAAAATTGA 
CTTTATCCTATATGATGCCATTTCATCCGACTATATTGTAAAAGACCAATCATTAAACTTAAGCGTTTCTC 
CTTTGAAAGGTAAAATTGGTAATAATAAGGATGGACTAGAATACCTCCTTTTACCAAAAGATAAAAAAGGT 
AAAACTCTACAGAAATTTATAAATAAGCGTATTAAAGTTTTGAAAGAAAATGGTACTTTGGCACGTTTAAG 
TAAACAATATTTCGGTGGAGATTACGTTTCAAACATTGATAAA 

SEQ ID NO. 2407: SAG0290 FROM THE COH1 GBS TYPE III STRAIN (REVERSE 
COMPLEMENT) 

GTATCAGTTCAGGCGTCAGAGAAAGTAGAACTTAAAGTAGCTACAGATTCTGACACGGCACCATTTACTTA 
TCAAAAAGACGGGAAATTCAAAGGTTATGACGTTGATGTTGTCAAAGCTGTTTTTAAAGGTAGTAAGTACA 
AAGTAACCTTCAAGACAGTTCCTTTTGATACTATTTCAAGAGGTATTGATGCAGGGAAATTTGATTTATC^ 
GCTAATGATTTTTCATATAATAAAGAAAGAGCAGAAAAATATCTCTTCTCAGATCCTATATCCCGTTCAAA 
TTATGC CGTAGTAGGGAAGAAGGGGAGC CATTACAAAT CATTAAGTGACCTCTCTGGAAAAT CAACAGAAG 
TTTTATCTGGCGTTAACTATGCAGAGGTTCTAGAAAATTGGAATAAAAATCATCCTAATAAAAAACCAATA 
AAAATCAAATATGTTTCTGC^ACAACTGGTGTTACTAGCAGATTAAAAAATATTGAGAGTGGAAAAATTGA 
CTTTATCCTATATGATGCCATTTCATCTGACTATATTGTAAAAGATCAATCATTAAACTTAAGCGTTTCTC 
CTTTGAAAGGTAAAATTGGTAATAATAAGGATGGATTAGAATAC CT CCTTTTACCAAAAGATAAAAAAGGT 
AAAACTCTACAGAAATTTATAAATAAGCGTATTAAAGTTTTGAAAGAAGATGGTACTTTGGCACGTTTAAG 
TAAACAATATTTCGGTGGAGATTACGTTTCAAACATTGATAAA 

SEQ ID NO. 2408: SAG0290 FROM THE H36b GBS TYPE lb STRAIN 
(REVERSE COMPLEMENT) 

GTAT CAGTT CAGGCGT CAGAGAAAGTAGAACTTAAAGTAG CTACAGATT CTGACACGGCAC CATTTACTTA 
TG&AAAAGACGGGAAATTCAAAGGTTATGATGTTGATGTT^^ 

AAGTAACCTTC^GA(^GTTCCTTTTGATACTATTTCAACAGGTATTGATG(^GGGAAATTTGATTTATCA 
GCTAATGATTTTTCATACAATAAAGAAAGAG(^GAAA^TATCTCTTCTCAGATCCTATATCCCGTTCAAA 
TTATGCCGTAGTAGGGAAGAAGGGGAGCCA.TTACAAATCATTAAGTGACCTCTCTGGAAAATCAACCGAAG 
TTTTATCTGGCGTTAACTATGCACAGGTTCTAGAAAATTGGAATAAAAATCATCCTAATAAAAAACCAATA 
AAAATCAAATATGTTTCTGC^ACAACTGGTGTTACTAGCAGATTAAAAAATATTGAGAGTGGGAAAATTGA 
CTTTATCCTATATGATGCCATTTCATCCGACTATATTGTAAAAGACCAATCATTAAACTTAAGCGTTTCTC 
CTTTGAAAGGTAAAATTGGTAATAATAAGGATGGACTAGAATACCTCCTTTTACCAAAAGATAAAAAAGGT 
AAAACTCTACAGAAATTTATAAATAAGCGTATTAAAGTTTTGAAAGAAAATGGTACTTTGGCACGTTTAAG 
TAAACAATATTTCGGTGGAGATTACGTTTCAAACATTGATAAA 
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Figure 24: Comparative Sequences relating to SAG0290 
(ABC transporter, substrate-binding protein) 



SKQ ID NO. 2409: SAG0290 FROM THE JM9130013 GBS STRAIN VIII (REVERSE 
COMPLEMENT) 

GTATC^GTTCAGGCGTCAGAGAAAGTAGAACTTAAAGTAGCTACAGATTCTGACACGGC^CCATTTACTTA 

TCAAAAAGACGGGAAATTC^AAGGTTATGATGTTGATGT^ 

AAGTAACCTTCAAGACAGTTCCTTTTGATACTATCT 

GCTAATGATTTTTCATACAATAAAGAAAGAGCAGAAAAATATCTCTTCTCAGATCCTATATCCCGTTCAAA 
TTATGCCGTAGTAGGGAAGAAGGGGAGCCATTACAAA.TCATTAAGTGACCTCTCTGGAAAATCAACCGAAG 
TTTTATCTGGCGTTAACTATGCACAGGTTCTAGAAAATTGGAATAAAAATCATCCTAATAAAAAACCAATA 
AAAATCAAATATGTTTCTGGGACAACTGGTGTTACTAGCAGATTAAAAAATATTGAGAGTGGGAAAATTGA 
CTTTATCCTATATGATGCCATTTCATCCGACTATATTGTAAAAGACCAATCATTAAACTTAA.GCGTTTCTC 
CTTTGAAAGGTAAAATTGGTAATAATAAGGATGGACTAGAATACCTCCTTTTACCAAAAGATAAAAAAGGT 
AAAACTCTACAGAAATTTATAAATAAGCGTAATAAAGTTTTGAAAGAAAATGGTA 

SEQ ID NO. 2410: SAG0290 FROM THE M732 GBS TYPE III STRAIN (REVERSE 
COMPLEMENT) 

GTATGAGTTG&GGCGTC^ 

TCAAAAAGACGGGAAATTCAAAGGTTATGACGTTGATGTTGTGAAAGCTGTTTTT^ 

AAGTAACCTTCAAGACAGTTCCTTTTGATACTATTTCAACAGGTATTGATGCAGGGAAATTTGATTTATCA 

GCTAATGATTTTTCATATAATAAAGAAAGAGCAGAAAAATATCTCTTCTCAGATCCTATATCCCGTTCAAA 

TTATGCCGTAGTAGGGAAGAAGGGGAGCCATTACAAATCATTAAGTGACCTCTCTGGAAAATCAACAGAAG 

TTTTATCTGGCGTTAACTATGCACAGGTTCTAGAAAATTGGAATAAAAATCATCCTAATAAAAAACC^ 

AAAATCAAATATGTTTCTGGGAG^CTGGTGTTACTAGCAGATTAAAAAA.TATTGAGAGTGGAAAAATTGA 

CTTTATCCTATATGATGCCATTTCATCTGACTATATTGTAAAAGATCAATCATTAAACTTAAGCGTTTCTC 

CTTTGAAAGGTAAAATTGGTAATAATAAGGATGGATTAGAATACCTCCTTTTACCAAAAGATAAAAAAGGT 

AAAACTCTACAGAAATTTATAAATAAGCGTATTAAAGTTTTGAAAGAAGATGGTACTTTGGCACGTTTAAG 

TAAACAATATTTCGGTGGAGATTACGTTTCAAACATTGATAAA 

SEQ ID NO. 2411: SAG0290 FROM THE M781 GBS TYPE III STRAIN (REVERSE 
COMPLEMENT ) 

GTATCAGTTCAGGCGTCAGAGAAAGTAGAACTTAAAX*^ 

TCAAAAAGACGGGAAATTCAAAGGTTATGACGTTGATGTTGTCAAAGCTGTTTTTAA^ 

AAGTAACCTTCAAGACAGTTCCTTTTGATACTATTTCAACAGGTATTGATGCAGGGAAATTTGATTTATCA 

GCTAATGATTTTTCATATAATAAAGAAAGAGCAGAAAAATATCTCTTCTCAGATCCTATATCCCGTTCAAA 

TT ATGC CGTAGTAGGGAAGAAGGGGAGC CATTACAAAT CATTAAGTGACCTCTCTGGAAAAT CAACAGAAG 

TTTTATCTGGCGTTAACTATGCACAGGTTCTAGAAAATTGGAATAAAAATC^TCCTAATAAAAAACCAATA 

AA^TCAAATATGTTTCTGGGACAACTGGTGTTACTAGCAGATTAAAAAATATTGAGAGTGGAAAAATTG^ 

CTTTATCCTATATGATGCCATTTCATCTGACTATATTGTAAAAGATCAATCATTAAACTTAAGCGTTTCTC 

CTTTGAAAGGTAAAATTGGTAATAATAAGGATGGATTAGAATACCTCCTTTTACCAAAAGATAAAAAAGGT 

AAAACTCTACAGAAATTTATAAATAAGCGTATTAAAGTTTTGAAAGAAGATGGTACTTTGGCACGTTTAAG 

TAAACAATATTTCGGTGGAGATTACGTTTCAAACATTGATAAA 

SEQ2 401 TATCAGTTCAGGCGTCAGAGAAAGTAGAACTTAAAGTAGCTACAGATTCTGACACGGCA 
SEQ24 0 2 TATCAGTTCAGGCGTCAGAGAAAGTAGAACTTAAAGTAGCTACAGATTCTGACACGGCA 



SEQ2403 
SEQ2404 
SEQ2405 
SEQ2406 
SEQ2407 
SEQ2408 
SEQ2409 
SEQ2410 
SEQ2411 



TATCAGTTCAGGCGTCAGAGAAAGTAGAACTTAAAGTAGCTACAGATTCTGACACGGCA 
TATCAGTTCAGGCGTCAGAGAAAGTAGAACTTAAAGTAGCTACAGATTCTGACACGGCA 
TATCAGTTCAGGCGTCAGAGAAAGTAGAACTTAAAGTAGCTACAGATTCTGACACGGCA 
TATCAGTTCAGGCGTCAGAGAAAGTAGAACTTAAAGTAGCTACAGATTCTGACACGGCA 
TATCAGTT CAGGCGTCAGAGAAAGTAGAACTTAAAGTAGCTACAGATT CTGACACGGCA 
TATCAGTTCAGGCGTCAGAGAAAGTAGAACTTAAAGTAGCTACAGATTCTGACACGGCA 
TATCAGTTCAGGCGTCAGAGAAAGTAGAACTTAAAGTAGCTACAGATTCTGACACGGCA 
TATCAGTTCAGGCGTCAGAGAAAGTAGAACTTAAAGTAGCTACAGATTCTGACACGGCA 



SEQ2401 
SEQ2402 
SEQ2403 
SEQ2404 
SEQ2405 
SEQ2406 
SEQ2407 
SEQ2408 
SEQ2409 



CATTTACTTAT CAAAAAGACGGGAAAT TCAAAGGT TATGATGTTGATGTTGTCAAAGCT 
CATTTACTTATRAAAAAGACGGGAAATTCAAAGGTTATGATGTTGATGTTGTCAAAGCT 



ATTCAAAGGTTATGATGTTGATGTTGTCAAAGCT 



CATTTACTTATCAAAAAGACGGGAAATTCAAAGGTTATGATGTTGATGTTGTCAAAGCT 
CATTTACTTAT CAAAAAGACGGGAAAT TCAAAGGT TATGATGTTGATGTTGTCAAAGCT 
CATTTACTTATCAAAAAGACGGGAAATTCAAAGGTTATGATGTT(^TGTTGTCAAAGCT 
CATTTACTTATCAAAAAGACGGGAAATTCAAAGGTTATGACGTTGATGTTGTCAAAGCT 
CATTTACTTATCAAAAAGACGGGAAATTC^AAGGTTATGATGTTGATGTTGTCAAAGCT 
CATTTACTTATCAAAAAGACGGGAAATTCAAAGGTTATGATGTTGATGTTGTCAAAGCT 
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Figure 24: Comparative Sequences relating to SAG0290 
(ABC transporter, substrate-binding protein) 



SEQ2410 
SEQ2411 

SEQ2401 
SEQ2402 
SEQ2403 
SEQ2404 
SEQ2405 
SEQ240 6 
SEQ2407 
SEQ2408 
SEQ2409 
SEQ24X0 
SEQ2411 

SEQ2401 
SEQ2402 
SEQ2403 
SEQ2404 
SEQ2405 
SEQ2406 
SEQ2407 
SEQ2408 
SEQ240 9 
SEQ2410 
SEQ2411 

SEQ2401 
SEQ2402 
SEQ2403 
SEQ2404 
SEQ2405 
SEQ2406 
SEQ2407 
SEQ2408 
SEQ2409 
SEQ241Q 
SEQ2411 

SEQ2401 
SEQ2402 
SEQ2403 
SEQ2404 
SEQ2405 
SEQ2406 
SEQ2407 
SEQ2408 
SEQ2409 
SEQ2410 
SEQ2411 

SEQ2401 
SEQ2402 
SEQ2403 
SEQ2404 
SEQ2405 
SEQ2406 
SEQ2407 
SEQ2408 
SEQ2409 
SEQ2410 
SEQ2411 

SEQ2401 
SEQ2402 
SEQ2403 
SEQ2404 
SEQ2405 
SEQ2406 
SEQ2407 
SEQ240 8 
SEQ2409 



CATTTACTTATCAAAAAGACGGGAAATTCAAAGGTTATGACGTTGATGTTGTayVAGCT 
CATTTACTTATCAAAAAGACGGGAAATTCAAAGGTTATGACGTTGATGTTGTCAAAGCT 

GTTTTTAAAGGTAGTAAGTACAAAGTAACCTTCAAGACAGTTCCTTTTGATACTATTTCA 
GTTTTTAAAGGTAGTAAGTACAAAGTAACCTTCAAGACAGTTCCTTTTGATACTATTTCA 
GTTTTTAAAGGTAGTAAGTACAAAGTAACCTTCAAGACAGTTCCTTTTGATACTATTTCA 
GTTTTTAAAGGTAGTAAGTACAAAGTAACCTTCAAGACAGTTCCTTTTGATACTATTTCA 
GTTTTTAAAGGTAGTAAGTACAAAGTAACCTT CAAGACAGTTCCTTTTGATACTAT TTCA 
GTTTTTAAAGGTAGTAAGTACAAAGTAACCTTCAAGACAGTTCCTTTTGATACTATTTCA 
GTTTTTAAAGGTAGTAAGTACAAAGTAACCTTCAAGACAGTTCCTTTTGATACTATTTCA 
GTTTTTAAAGGTAGTAAGTACAAAGTAACCTTCAAGACAGTTCCTTTTGATACTATTTCA 
GTTTTTAAAGGTAGTAAGTACAAAGTAACCTTCAAGACAGTTCCTTTTGATACTATTTCA 
GTTTTTAAAGGTAGTAAGTACAAAGTAACCTTCAAGACAGTTCCTTTTGATACTATTTCA 
GTTTTTAAAGGTAGTAAGTACAAAGTAACCTTCAAGACAGTTCCTTTTGATACTATTTCA 

ACAGGTATTGATGCAGGGAAATTTGATTTATCAGCTAATGATTTTTCATACAATAAAGAA 
ACAGGTATTGATGCAGGGAAATTTGATTTATCAGCTAATGATTTTTCATACAATAAAGAA 
ACAGGTATTGATGCAGGGAAATTTGATTTATCAGCTAATGATTTTTCATACAATAAAGAA 
ACAGGTATTGATGCAGGGAAATTTGATTTATCAGCTAATGATTTTTCATACAATAAAGAA 
ACA.GGTATTGATGCAGGGAAATTTGATTTATCAGCTAATGATTTTTCATACAATAAAGAA 
ACAGGTATTGATGC^GGGAAATTTGATTTATCAGCTAATGATTTTTCATACAATAAAGAA 
ACAGGTATTGATGCAGGGAAATTTGATTTATCAGCTAATGATTTTTCATATAATAAAGAA 
ACAGGTATTGATGCAGGGAAATTTGATTTATCAGCTAATGATTTTTCATACAATAAAGAA 
ACAGGTATTGATGCAGGGAAATTTGATTTATCAGCTAATGATTTTTCATACAATAAAGAA 
ACAGGTATTGATGCAGGGAAATTTGATTTATCAGCTAATGATTTTTCATATAATAAAGAA 
ACAGGTATTGATGCAGGGAAATTTGATTTATCAGCTAATGATTTTTCATATAATAAAGAA 

AGAGCAGAAAAATATCTCTTCTCAGAC CCTATATCCCGTT CAAATTATGCCGTAGTAGGG 
AGAGCAGAAAAATATCTCTTCTCAGATCCTATATCCCGTTCAAATTATGCCGTAGTAGGG 
AGAGCAGAAAAATATCTCTTCTCAGATCCTATATCCCGTTCAAATTATGCCGTAGTAGGG 
AGAGCAGAAAAATATCTCT TCT CAGATCCTATATCCCGTTCAAATTATGCCGTAGTAGGG 
AGAGCAGAAAAATATCTCTTCTCAGATCCTATATC CCGTTCAAATTATGC CGTAGTAGGG 
AGAGCAGAAAAATATCTCTTCTCAGATCCTATATCCCGTTCAAATTATGCCGTAGTAGGG 
AGAGCAGAAAAATATCTCT TCT CAGATCCTATATCCCGTTCAAATTATGCCGTAGTAGGG 
AGAGCAGAAAAATATCTCTTCT CAGATCCTATATC CCGTTCAAATTATGCCGTAGTAGGG 
AGAGCAGAAAAATATCTCTTCTCAGATCCTATATCCCGTTCAAATTATGCCGTAGTAGGG 
AGAGCAGAAAAATATCTCTTCTCAGATCCTATATCCCGTTCAAATTATGCCGTAGTAGGG 
ACtAGCAGAAAAATATCTCTTCTCAGATCCTATATCCCGTTCAAATTATGCCGTAGTAGGG 

AAGAAGGGGAGCCATTACAAATCATTAAGTGACCT CTCTGGAAAAT CAACAGAAGTTTTA 
AAGAAGGGGAGCC^TTACAAATCATTAAGTGACCTCTCTGGAAAATC^CCGAAGTTTTA 
AAGAAGGGGAGCCATTACAAATCATTAAGTGACCTCTCTGGAAAATCAACCGAAGTTTTA 
AAGAAGGGGAGCCATTACAAATCATTAAGTGACCT CTCTGGAAAATCAACCGAAGT TTTA 
/LAGAAGGGGAGCCATTACAAATCATTAAGTGACCTCTCTGGAAAATCAACCGAAGTTTTA 
AAGAAGGGGAGCCATTACAAATCATTAAGTGACCTCTCTGGAAAATCAACCGAAGTTTTA 
AAGAAGGGGAGCCATTACAAATCATTAAGTGACCTCTCTGGAAAATCAACAGAAGTTTTA 
AAGAAGGGGAGCCATTACAAATCATTAAGTGACCTCTCTGGAAAATCAACCGAAGTTTTA 
AAGAAGGGGAGCC^TTAOU^TC^TTAAGTGACCTCTCTGGAAAATC^CCGAAGTTTTA 
AAGAAGGGGAGCCATTAOVAATCATTAAGTGACCTCTCTGGAAAATC^CAGAAGTTTTA 
AAGAAGGGGAGCCATTACAAATCATTAAGTGACCT CTCTGGAAAATCAACAGAAGTT TTA 

TCTGGCGTTAACTATGCACAGGTTCTAGAAAATTGGAATAAAAATCATCCTAATAAAAAA 
TCTGGCGTTAACTATGCACAGGTTCTAGAAAATTGGAATAAAAATCATCCTAATAAAAAA 
TCTGGCGTTAACTATGCACAGGTTCTAGAAAATTGGAATAAAAATCATCCTAATAAAAAA 
TCTGGCGTTAACTATGC^CAGGTTCTAGAAAATTGGAATAAAAATCATCCTAATAAAAAA 
TCTGGCGTTAACTATGCACAGGTTCTAGAAAATTGGAATAAAAATCATNNTAATAAAAAA 
TCTGGCGTTAACTATGCACAGGTTCTAGAAAATTGGAATAAAAATCATCCTAATAAAAAA 
TCTGGCGTTAACTATGCACAGGTTCTAGAAAATTGGAATAAAAATCATCCTAATAAAAAA 
TCTGGCGT TAACTATGCACAGGTTCTAGAAAATTGGAATAAAAATCAT CCTAATAAAAAA 
TCTGGCGTTAACTATGCACAGGTTCTAGAAAATTGGAATAAAAATCATCCTAATAAAAAA 
TCTGGCGTTAACTATGC^CZAGGTTCTAGAAAATTGGAATAAAAATC!ATCCTAATAAAAAA 
TCTGGCGTTAACTATGCAC^GGTTCTAGAAAATTGGAATAAAAATCATCCTAATAAAAAA 

CCAATAAAAATCAAATATGTTTCTGGGACAACTGGTGTTACTAGCAGATTAAAAAATATT 
CCAATAAAAATCAAATATGTTTCTGGGACAACTGGTGTTACTAGCAGATTAAAAAATATT 
CCAATAAAAAT CAAATATGTTT CTGGGACAACTGGTGT TACTAGCAGATTAAAAAATAT T 
C CAATAAAAAT CAAATATGTTT CTGGGACAACTGGTGTTACTAGCAGATTAAAAAATATT 
CCANTAAAAATNAAATATGTTTCTGGGACAACTGGTGTTACTAGCAGATTAAAAAATATT 
CCAATAAAAAT CAAATATGTTT CTGGGACAACTGGTGTTACTAGCAGATTAAAAAATATT 
COVATAAAAATCAAATATGTTTCTGGGACAACTGGTGTTACTAGCAGATTAAAAAATATT 
CCAATAAAAATCAAATATGTTTCTGGGACAACTGGTGTTACTAGCAGATTAAAAAATATT 
CCAATAAAAATCAAATATGTTTCTGO ACTGGTGTTACTAGCAGATTAAAAAATATT 
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Figure 24: Comparative Sequences relating to SAG0290 
(ABC transporter, substrate-binding protein) 



SEQ2410 
SEQ2411 

SEQ2401 
SEQ2402 
SEQ24 03 
SEQ2404 
SEQ2405 
SEQ2406 
SEQ2407 
SEQ240 8 
SEQ24 09 
SEQ2410 
SEQ2411 

SEQ2401 
SEQ2402 
SEQ2403 
SEQ2404 
SEQ2405 
SEQ240 6 
SEQ2407 
SEQ240 8 
SEQ24 0 9 
SEQ2410 
SEQ2411 

SEQ240X 
SEQ2402 
SEQ2403 
SEQ2404 
SEQ2405 
SEQ240 6 
SEQ2407 
SEQ240 8 
SEQ240 9 
SEQ2410 
SEQ2411 

SEQ2401 
SEQ2402 
SEQ2403 
SEQ2404 
SEQ2405 
SEQ240 6 
SEQ2407 
SEQ240 8 
SEQ240 9 
SEQ2410 
SEQ2411 

SEQ2401 
SEQ2402 
SEQ2403 
SEQ2404 
SEQ24 05 
SEQ2406 
SEQ2407 
SEQ2408 
SEQ2409 
SEQ2410 
SEQ2411 



CC^TAA?U^TC^ATATGTTTCTGGGAC^CTGGTGTTACTAGCAGATTAAAAAATATT 
C CAATAAAAATCAAATATGXTT CTGGGACAACTGGTGTTACTAGCAGATTAAAAAATATT 

GAGAGTGGGAAAATTGACTTTATCCTATATGATGCCATTTCATCTGACTATATTGTAAAA 
GAGAGTGGGAAAATTGACTTTATCCTATATGATGCCATTTCATCCGACTATATTGTAAAA 
GAGAGTGGGAAAATTGACTTTATCCTATATGATGCCATTTCATCCGACTATATTGTAAAA 
GAGAGTGGGAAAATTGACTTTATCCTATATGATGC CATTT CATCCGACTATATTGTAAAA 
GAGAGTGGGAAAATTGACTTTATCCTATATGATGCCATTTCATCCGACTATATTGTAAAA 
GAGAGTGGGAAAATTGACTTTATCCTATATGATGCCATTTCATCCGACTATATTGTAAAA 
GAGAGTGGAAAAATTGACTTTATCCTATATGATGCCATTTCATCTGACTATATTGTAAAA 
GAGAGTGGGAAAATTGACTTTATCCTATATGATGCCATTTCATCCGACTATATTGTAAAA 
GAGAGTGGGAAAATTGACTTTATCCTATATGATGCCATTTCATCCGACTATATTGTAAAA 
GAGAGTGGAAAAAT TGACTTTATCCTATATGATGCCATTT CATCTGACTATATTGTAAAA 
GAG AGTGGAAAAAT TGAC T T TAT C CTATATGAT GC CAT TT CAT C TGAC TATAT T GT AAAA 

GATCAATCATTAAACTTAAGCGTTTCTCCTTTGAAAGGTAAAATTGGTAATAATAAGGAT 
GACCAATCATTAAACTTAAGCGTTTCTCCTTTGAAAGGTAAAATTGGTAATAATAAGGAT 
GACCAATCATTAAACTTAAGCGTTTCTCCTTTGAAAGGTAAAATTGGTAATAATAAGGAT 
GACCAATCATTAAACTTAAGCGTTTCTCCTTTGAAAGGTAAAATTGGTAATAATAAGGAT 
GACCAATCA.TTAAACTTAAGCGTTTCTCCTTTGAAAGGTAAAATTGGTAATAATAAGGAT 
GACCAATCATTAAACTTAAGCGTTTCTCCTTTGAAAGGTAAAATTGGTAATAATAAGGAT 
GAT CAATCATTAAACTTAAGCGTTTCT C CTTTGAAAGGTAAAATTGGTAATAATAAGGAT 
GACCAATCATTAAACTTAAGCGTTTCTCCTTTGAAAGGTAAAATTGGTAATAATAAGGAT 
GACCAATCATTAAACTTAAGCGTTTCTCCTTTGAAAGGTAAAATTGGTAATAATAAGGAT 
GATCAATCATTAAACTTAAGCGTTTCTCCTTTGAAAGGTAAAATTGGTAATAATAAGGAT 
GATCAATCATTAAACTTAAGCGTTTCTCCTTTGAAAGGTAAAATTGGTAATAATAAGGAT 

GGATTAGAATACCTCCTTTTACCAAAAGATAAAAAAGGTAAAACTCTACAGAAATTTATA 
GGACTAGAATACCTCCTTTTACCAAAAGATAAAAAAGGTAAAACTCTACAGAAATTTATA 

GGACTAGAATACCTCCTTTTACCAAAAGATAAAAAAG 

GGACTAGAATACCTCCTTTTACCAAAAGATAAAAAAGGTAAAACTCTACAGAAATTTATA 
GGACTAGAATACCTCCTTTTACCAAAAGATAAAAAAGGTAAAACTCTACAGAAATTTATA 
GGACTAGAATACCTCCTTTTACCAAAAGATAAAAAAGGTAAAACTCTACAGAAATTTATA 
GGATTAGAATACCTCCTTTTACCAAAAGATAAAAAAGGTAAAACTCTACAGAAATTTATA 
GGACTAGAATACCTCCTTTTACCAAAAGATAAAAAAGGTAAAACTCTACAGAAATTTATA 
GGACTAGAATACCTCCTTTTACCAAAAGATAAAAAAGGTAAAACTCTACAGAAATTTATA 
GGATTAGAATACCTCCTTTTACCAAAAGATAAAAAAGGTAAAACTCTACAGAAATTTATA 
GGATTAGAATACCTCCTTTTACCAAAAGATAAAAAAGGTAAAACTCTACAGAAATTTATA 

AT AAG C GTAT TAAAGT T TT GAAAGAAGATGGT AC T TTGG CACGTTTAAGTAAACAATAT 
ATAAGCGTATTAAAGTTTTGAAAGAAAATGGTACTTTGGCACGTTTAAGTAAACAATAT 

ATAAGCGTATTAAAGTTTTGAAAGAAAATGGTACTTTGGCACGTTTAAGTAAACAATAT 

ATAAGCGT 

ATAAGCGTATTAAAGTTTTGAAAGAAAATGGTACTTTGGCACGTTTAAGTAAACAATAT 
ATAAGCGTATTAAAGTTTTGAAAGAAGATGGTACTTTGGCACGTTTAAGTAAACAATAT 
ATAAGCGTATTAAAGTTTTGAAAGAAAATGGTACTTTGGCACGTTTAAGTAAACJAATAT 

ATAAGCGTAATAAAGTTTTGAAAGAAAATGGTA 

ATAAGCGTATTAAAGTTTTGAAAGAAGATGGTACTTTGGCACGTTTAAGTAAACAATAT 
ATAAGCGTATTAAAGTTTTGAAAGAAGATGGTACTTTGGCACGTTTAAGTAAACAATAT 

TCGGTGGAGATTACGTTTCAAACATTGATAAA 

TCGGTGGAGATTACGTTT CAAACATTGATAAA 

TCGGTGGAGATTACGTTTCAAACATTGATAAA 

TCGGTGGAGATTACGTTTCAAACATTGATAAA 

T CGGTGGAGAT TACGTTTCAAACATTGATAAA 

TCGGTGGAGATTACGTTTCAAACATTGATAAA 

TCGGTGGAGATTACGTTTCAAACATTGATAAA 

T CGGTGGAGAT TACGTTTO\AAC^TTGATAAAGTRCNmRATVSTNCSRATNGTSAGABC 



600 



WO 2004/018646 



Figure 24: Comparative Sequences relating to SAG0290 
(ABC transporter, substrate-binding protein) 

SEQ2401 

SEQ2402 

SEQ2403 

SEQ2404 

SEQ2405 

SEQ2406 

SEQ2407 

SEQ2408 

SEQ2409 

SEQ2410 

SEQ2 411 RANSRTRSTBS TRATBNDNGRTN 

>SEQ ID NO 2450: 8_JL169NT frame: 1 

VSVQASEKVELKVATDSDTAPFTYQKDGKFKGYDVT3WKAVFKGSKYKVTFKTVPFDTIS 
TGIDAGKFDLSANDFS YNKERAEKYLFSDP I SRSNYAVVGKKGSHYKSLSDLSGKSTEVL 
SGVNYAQVLENWNKNHPNKKPI KI KYVSGTTGVTSRLKNI ESGKIDFILYDAISSDYIVK 
DQS LNLS VS PLKGKIGNNKDGL E YLLLPKDKKGKTLQKF I NKRI KVLKEDGTIiARL SKQ Y 
FGGDYVSNIDK 

>SEQ ID NO 2451:8_18RS21 frame: 1 

VSVQAS EKVELKVATDSDTAP FT YXKDGKFKG YDVDWKAVFKGS KYKVTFKTVP FDTI S 
TGIDAGKFDLSANDFS YNKERAEKYLFSDPISRSNYAVVGKKGSHYKSLSDLSGKSTEVL 
SGVNYAQVLENWNKNHPNKKPI KI KYVSGTTGVTSRLKNIESGKIDF ILYDAI S SDYI VK 
DQSLNLSVS PLKGKIGNNKDGLEYLLLPKDKKGKTLQKFINKRI KVLKENGTLARLSKQY 
FGGDYVSNIDK 

>SEQ ID NO 2452:8_2603 f rame : 2 

FKGYDVDWKAVFKGSKYKVTFKTVPFDTISTGIDAGKFDLSANDFSYNKERAEKYLFSD 
PISRSNYAWGKKGSHYKSLSDLSGKSTETVLSGVNYAQVLFJSW^ KYVSG 
TTGVTSRLKNIESGKIDFILYDAISSDYIVKDQSLNLSVSPLKGKIGNNKDGLEYLLLPK 
DKK 

>SEQ ID NO 2453:8_090 frame: 1 

VSVQAS EKVELKVATDSDTAPFTYQKDGKFKGYDVDVVKAVFKGSKYKVTFKTVPFDTIS 
TGIDAGKFDLSANDFS YNKERAEKYLFSDP I SRSNYAVVGKKGSHYKSLSDLSGKSTEVL 
SGVNYAQVLENWNKNHPNKKPIKIKYVSGTTGVTSRLKNIESGKIDFILYDAISSDYIVK 
DQSLNLSVSPLKGKIGNNKDGLEYLLLPKDKKGKTLQKFINKRIK^^ 
FGGDYVSNIDK 

>SEQ ID NO 2454:8_A909 frame: 1 

VSVQAS EKVELKVATDSDTAP FT YQKDGKFKGYDVDVVKAVFKGS KYKVTFKTVP FDT I S 
TGIDAGKFDLSANDFS YNKERAEKYLFSDPISRSNYAVVGKKGSHYKSLSDLSGKSTEVL 
SGVNYAQVLENWNKNHXNKKPXKXKYVSGTTGVTSRLKNI ESGKIDFI LYDAI S SDYIVK 
DQ S LNL SVS PL KGK I GNNKDGLE YLLLPKDKKGKTLQKF I NKR 

>SEQ ID NO 2455: 8_CJB110 frame: 1 

VSVQAS EKVELKVATDSDTAPFTYQKDGKFKGYDVDWKAVFKGSKYKVTFKTVPFDTI S 
TGIDAGKFDLSANDFSYNKERAEKYLFSDPISRSNYAVVGKKGSHYKSLSDLSGKSTEVL 
SGVNYAQVLENWNKNHPNKKPI KIKWSGTTGVTSRLKNI ESGKIDFILYDAISSDYIVK 
DQSLNLSVSPLKGKIGNNKDGLEYLLLPKDKKGKTLQKFINKRIKVLKENGTIARLSKQY 
FGGDYVSNIDK 

>SEQ ID NO 2456: 8_COHl frame: 1 

VSVQAS EKVELKVATDSDTAPFTYQKDGKFKGYDVDWKAVFKGSKYKVTFKTVPFDTI S 
TGIDAGKFDLSANDFS YNKERAEKYLFSDPI SRSNYAVVGKKGSHYKSLSDLSGKSTEVL 
SGVNYAQVLENWNKNHPNKKPIKIKYVSGTTGVTSRLKNIESGKIDFILYDAISSDYIVK 
DQSLNLSVSPLKGKIGNNKDGLEYLLLPKDKKGKTLQKFINKRIKVLKEDGTLARLSKQY 
FGGDYVSNIDK 

>SEQ ID NO 2457:8_H36B frame: 1 

VSVQAS EKVELKVATDSDTAPFTYQKDGKFKGYDVDWKAVFKGSKYKWFKTVPFDTIS 
TGIDAGKFDLSANDFS YNKERA.EKYLFSDPISRSNYAVVGKKGSHYKSLSDLSGKSTEVL 
SGVNYAQVLENWNKNHPNKKP I KIKYVSGTTGVTSRLKNI ESGKIDFILYDAISSDYIVK 
DQSLNLSVSPLKGKIGNNKDGLEYLLLPKDKKGKTLQKFINKRIKVLKENGTLARLSKQY 
FGGDYVSNIDK 
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Figure 24: Comparative Sequences relating to SAG0290 
(ABC transporter, substrate-binding protein) 



>SEQ ID NO 2458 : 8_JM913 0013 frame: X 

VSVQAS EKVELKVATDSDTAPFT YQKDGKF KG YDVD WKAVFKGS KYKVTF KTVPFDT I S 
TGIDAGKFDLSAlTOFSYNKERAEKYIiFSDPISRSNYAWGKKGSHYKSIiSDLSGKSTFArL 

SGVNYAQVLFJJWNKNHPNKKPIKIKYVSGTTGWSFJ^^ 
DQSIjNLSVSPLKGKIGNNKDGLEYLLLPKDKKGKTLQKFINKRNKVIjKENG 

>SEQ ID NO 2459:8_M732 frame: 1 

VSVQAS EKVELKVATDSDTAPFTYQKDGKFKGYDVDWKAVFKGSKYKVTFKTVPFDTI s 
TGIDAGKFDLSANDFSYNKERAEKYLFSDPISRSNYAWGKKGSHYICSLSDLSGKSTEVL 
SGVNYAQVLENWNKNHPNKKPI KIKYVSGTTGVTSRIiKNI ESGKIDFI LYDAI SSDYIVK 
DQSLNLSVSPLKGKIGNNKDGLEYLLLPKDKKGKTLQKFINKRIKVLK^^ 
FGGDYVSNIDK 

>SEQ ID NO 2460:8_M781 frame: 1 

VSVQAS EKVEIjKVATDSDTAPFTYQKDGKFKGYDTOWKAVFKGS KYKVTF KTVPFDT IS 
TGIDAGKFDLSANDFS YNKERAEKYLFSDP I S RSNYAWGKKGSH YKSLSDLS GKS TEVL 
SGVNYAQVLENWNKNHPNKKPIKIKYVSGTTGVTSRLKNIESGKIDFILYDAISSDYIVK 
DQSLNLSVSPLKGKIGNNKDGLEYLLLPKDKKGKTLQKFINKRIKVLKEDGTIiARLSKQY 
FGGDYVSNIDK 
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SEQ2457 
SEQ2458 
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SVQASEK^ELKVATDSDTAPFTYQKDGKFKGYDVDVVKAVFKGSKYKVTFKTVPFDTIS 
SVQASEKVELKVATDSDTAPFTYXKDGKFKGYDVXIWKAVFKGSKYKVTFKTVPFDTIS 

FKGYDVDWKAVFKGSKYKVTFKTVPFDTIS 

SVQASEKVELKVATDSDTAPFTYQKDGKFKGYDVDVVKAVFKGSKYKVTFKTVPFDTIS 
S VQAS E KVE L KVATD S DTAP F T YQKDGKF KG YD VD WKAVF KG S KY KVTFKTVP FD T I S 
SVQASEKVELKVATDSDTAPFTYQKDGKFKGYDVDVVKAVFKGSKYKVTFKTVPFDTIS 
SVQASEKVEDKVATDSDTAPFTYQKDGKFKGYDVDVVKAVFKGSKYKVTFKTVPFDTIS 
SVQASEKVELKVATDSDTAPFTYQKDGKFKGYDVDWKAVFKGSKYKVTFKTVPFDTIS 
SVQASEKVELKVATDSDTAPFTYQKDGKFKGYDVDVVKA.VFKGSKYKVTFKTVPFDTIS 
S VQAS EKVELKVATDSDTAPFT YQKDGKFKGYDVD WKAVFKGS KYKVTFKTVP FDTIS 
SVQASEKVELKVATDSDTAPFTYQKDGKFKGYDVDWKAVFKGSKYKVTFKTVPFDTIS 

TGIDAGKFDLSANDFSYNKERAEKYLFSDPISRSNYAWGKKGSHYKSLSDLSGKSTEVL 
TGIDAGKFDLSANDFSYNKERAEKYLFSDPISRSNYAWGKKGSHYKSLSDLSGKSTEVL 
TGIDAGKFDLSANDFSYNKERAEKYLFSDPISRSNYAWGKKGSHYKSLSDLSGKSTEVL 
TGIDAGKFDLSANDFSYNKERAEKYLFSDPISRSNYAWGKKGSHYKSLSDLSGKSTEVL 
TGIDAGKFDLSANDFSYlNfKERAEKYLFSDPISRSNYAWGKKGSHYKSLSDLSGKSTEVL 
TGIDAGKFDLSANDFSYNKERAEKYLFSDPISRSNYAVVGKKGSHYKSIjSDIjSGKSTEVL 

tgidagkfdlsandfs ynkeraekylfsdp i srsnyawgkkgshykslsdlsgkstevl 
tgidagkfdlsandfsynkeraekylfsdpisrsnyawgkkgshykslsdlsgkstevl 
tgidagkfdlsandfsynkeraekylfsdpisrsnyawgkkgshykslsdlsgkstevl 
tgidagkfdlsandfsynkeraekylfsdpisrsnyavvgkkgshykslsdlsgkstevl 
tgidagkfdlsandfsymeraekylfsdpisrsnyawgkkgshykslsdlsgkstevl 

sgvnyaqvlf2jwnknhpnkkp i ki kyvsgttgvts rlkni esgkidf ilydai s sdyi vk 
sgvnyaqvlenwnknhpnkkpi kikyvsgttgvtsrlkniesgkidfilydai ssdyivk 

SGVNYAQVLENWNKNHPNKKP I KIKYVSGTTGVTSRLKNI ESGKIDF I LYDAI SSDYIVK 
SGVNYAQVLENWNKNHPNKKPIKIKYVSGTTGVTSRLKNIESGKIDFILYDAISSDYIVK 
SGVNYAQVLENWNKNHXNKKPXKXKWSGTTGVTSRLKNIESGKIDFILYDAISSDYIVK 
SGVNYAQVLENWNKNHPNKKP I KIKYVSGTTGVTSRLKNIESGKIDFILYDAI SSDYIVK 
SGVNYAQVLENWNKNHPNKKP I KIKYVSGTTGVTSRLKNI ESGKIDFILYDAI SSDYIVK 
SGVNYAQVLENWNKNHPNKKPIKIKYVSGTTGVTSRLKNIESGKIDFILYDAISSDYIVK 
SGVNYAQVLENWNKNHPNKKPI KIKYVSGTTGVTSRLKNI ESGKIDFILYDAISSDYIVK 
SGVNYAQVLENWNKNHPNKKP I KIKYVSGTTGVTSRLKNI ESGKIDF ILYDAISSDYIVK 
SGVNYAQVLENWNKNHPNKKPIKI KYVSGTTGVTS RLKNI ESGKIDFI LYDAI S SDYI VK 

DQSLNLS VS PLKGKIGNNKDGLEYLLLPKDKKGKTLQKFINKRI KVLKEDGTLARLSKQY 
DQSLNLSVSPLKGKIGNNKDGLEYLLLPKDKKGKTLQKFINKRIKVLKENGTLARLSKQY 

DQSLNLSVSPLKGKIGNNKDGLEYLLLPKDKK 

DQSLNLSVSPLKGKIGNNKDGLEYLLLPKDKKGKTLQKFINKRIKVLKENGTLARLSKQY 

DQSLNLS VSPLKGKIGNNKDGLEYLLLPKDKKGKTLQKFINKR 

DQSLNLSVSPLKGKIGNNKDGLEYLLLPKDKKGKTLQKFINKRIKVLKENGTLARLSKQY 
DQSLNLSVSPLKGKIGNNKDGLEYLLLPKDKKGKTLQKF I NKR I KVLKEDGTLARLSKQY 
DQSLNLSVSPLKGKIGNNKDGLEYLLLPKDKKGKTLQKFINKRIKVLKENGTLARLSKQY 

DQSLNLSVSPLKGKIGNNKDGLEYLLLPKDKKGKTLQKFINKRNKVLKENG- 

DQSLNLSVSPLKGKIGNNKDGLEYLLLPKDKKGKTLQKFINKRIKVLKEDGTLARLSKQY 
DQSLNLSVSPLKGKIGNNKDGLEYLLLPKDKKGKTLQKFINKR I KVLKEDGTLARLSKQY 
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Figure 24: Comparative Sequences relating to SAG0290 
(ABC transporter, substrate-binding protein) 
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Table 25: Comparative Sequences relating to SAG0368 

(protein of unknown function) 

SEQ ID NO. 2501: SAG0368 FROM THE 090 GBS TYPE la STRAIN 

TATAATTTTTCGAC TAATGAATTGTC TAAGACT TTTAAAGATTTTAAGC TAGCTAAAT CAAAAAGTC ATGCTATTGAA 
GAAACAAAGCCGTTTTCAATACTATTAATGGGGGTGGACACAGGTTCAGAGCATCGAAAATCTAAGTGGTCAGGAAAT 
AGCGATTCTATGAT C TTAGTC ACTATAAATCCTAAAACTAATAAAAC AACGATGACAAGC TTAGAACGTGAC GTATTG 
ATTAAATTGAGTGGTCCCAAAAATAATGGACAGACTGGAGTAGAAGCAAAGCTAAATGCAGCCTATGCTTCTGGTGGT 
GCGGAAATGG CATTGATGACTGTT C AAGAC TTATTAGATATTAATGTTGATTAC TTTATGCAAATTAATATG C AAGGA 
TTAGTTGATTTAGTCAATGCTGTTGGTGGTATAACAGTAACTAATAAATTTGACTTTCCAATATCAATTGCTGCCAAT 
GAACCAGAGTACAAGGCTGTTGTTGAACCAGGGACACATAAAATAAATGGAGAACAAGCACTTGTTTATTCTCGTATG 
CGCTATGATGAT C C AGAGGGAGATTATGGGCGT CAAAAAAGAC AACGTGAAGTAATTCAAAAAGTC CTT AAAAAAATA 
TTGGCGTTAAATAGTATTAGTTCATACAAAAAAATTCTTTCCGCAGTAAGTAATAACATGCAAACTAATATTGAGATA 
TCATCAAAAACGATTCCTAATTTGTTAGCTTATAAAGATTCATTGGAACATATTAAATCTTATCAGTTGAAGGGTGAA 
GACGCTACTTTAT C AGATGGTGGCTCTTAT CAAATTTTAACTAAGAAACATC TAG TTGCAGTT CAAAATAGAATTAAG 
AAAGAACTGGATAAAAAGCGTAGTAAAACTCTGAAGACAAGCGCGATTCTATATGAAGATTACTATGGTACTACTGCT 
AGTAATGATTCT T CTACTTATTC ATC AAC AC AAGAGAATAATTATAATAC AAC ACCTTATTC AGAAGC AC C AC C AAGT 
TAC AGTGGTAATAC TACTTAT AGTTCTG AGAC T AATCAAAC AACT C ATCAAAATTACTATAATAGTAGCACT C CTGCT 
AGTAACTATAGCAGTAACACTAACACAGGTCAGGCTGATTCAAGTGGAAGTGTCAATAATCATAACGGGGCTGCAACG 
CCTAATCCA 

SEQ ID NO. 2502: SAG0368 FROM THE 1169NT1 GBS TYPE V STRAIN 

TATAATTTTTCGACTAATGAATTGTCTAAGACTTTTAAAGATTTTAAGCTAGCTAAATCAAAAAGTCATGCTATTGAA 
GAAACAAAGCCGTTTTCAATACTATTAATGGGGGTGGACACAGGTTCAGAGCATCGAAAATCTAAGTTGGTCAGGAAA 
TAGCGATT CTATGATCTTAGTC AC TATAAAT CC TAAAACTAATAAAACAACGATGACAAGC T TAGAACGTGACGTATT 
GATTAAATTGAGTGGTCCCAAAAATAATGGACAGACTGGCGTAGAAGCAAAGCTAAATGCAGCCTATGCTTCTGGTGG 
TGCGGAAATGGCATTGATGACTGTTCAAGACTTATTAGATATTAATGTTGATTACTTTATGCAAATTAATATGCAAGG 
ATTAGTTGATTTAGTCAATGCTGTTGGTGGTATAACAGTAACTAATAAATTTGACTTTCCAATATCAATTGCTGCCAA 
TGAACCAGAGTACAAGGCTGTTGTTGAACCAGGGACACATAAAATAAATGGAGAACAAGCACTTGTTTATTCTCGTAT 
GCGCTATGATGAT C CAGAGGGAGATTATGGGCGTCAAAAAAGACAACGTGAAGTAATTCAAAAAGTC C TTAAAAAAAT 
ATTGGCGTTAAATAGTATT AGTT C ATACAAAAAAATTCTTT C CGCAGTAAGTAATAACATGCAAACTAATATTGAGAT 
ATC ATCAAAAAC GATTC CTAATTTGTTAGCTTATAAAGATT CATTGGAACATATTAAATCTTATCAGT TGAAAGGTGA 
AGACGCTACTTTATCAGATGGTGGCTCTTATCAAATTTTAACTAAGAAACATCTACTTGCAGTTCAAAATAGAATTAA 
GAAAGAACTAGATAAAAAGCGTAGTAAAACTCTGAAGACAAGCGCGATTCTATATGAAGATTACTATGGTACTACTGC 
TAGTAATGATTCTTCTACTTATTCATCAACACAAGAGAATAATTATAATACAACACCTTATTCAGAAGCACCACCAAG 
TTACAGTGGTAATACTACTTATAGTTCTGAGACTAATCAAACAACTCATCAAAGTTACTATAATAGTAGCACTCCTGC 
TAATAACTATAGCAGTAACACTAACACAGGTCAGGCTGATTCAAGTGGAAGTGTCAATAATCATAATGGGGCTGCAAC 
GCCTAATCCA 

SEQ ID NO. 2503 SAG0368 FROM THE 18RS21 GBS TYPE II STRAIN 

TATAATTTTTCGAC TAATGAATTGTCTAAGAC TTTTAAAGATTTTAAGCTAGCTAAATCAAAAAGTCATGC TATTGAA 
GAAACAAAGCCGTTTTCAATACTATTAATGGGGGTGGACACAGGTTCAGAGCATCGAAAATCTAAGTGGTCAGGAAAT 
AGCGATTCTATGATC TTAGT CACTATAAATCC TAAAACTAATAAAACAACGATGAC AAGC TTAGAACGTGACGTATTG 
ATTAAATTGAGTGGTCCCAAAAATAATGGACAGACTGGAGTAGAAGCAAAGCTAAATGCAGCCTATGCTTCTGGTGGT 
GCGGAAATGGCATTGATGACTGTTCAAGACTTATTAGATATTAATGTTGATTACTTTATGCAAATTAATATGCAAGGA 
TTAGTTGATTTAGTCAATGCTGTTGGTGGTATAACAGTAACTAATAAATTTGACTTTCCAATATCAATTGCTGCCAAT 
GAACCAGAGTAC AAGGCTGTTGTTGAAC C AGGGACAGATAAAATAAATGGAGAACAAGCACTTGT TTATTCT CGTATG 
CGCTATGATGAT C C AGAGGGAGAT TATGGGCGT C AAAAAAGACAACGTGAAGTAATTCAAAAAGTC CTTAAAAAAATA 
TTGGCGTTAAAT AGTATTAGTT C ATACAAAAAAATT CTTT C CGC AGTAAGTAATAAC ATGC AAAC TAATATTGAGATA 
T CAT C AAAAACGAT T C C T AAT T TG T T AG C T T AT AAAG AT T C AT TG GAA CAT AT T AAAT C T TAT C AGT TG AAG GGTG AA 
GACGC TACTT TAT C AGATGGTGGC T CTTATC AAATTTTAACTAAGAAAC AT CT ACT TGC AGT TC AAAATAGAATTAAG 
AAAGAAC TGGATAAAAAGCGT AGT AAAAC TC TGAAG AC AAGCGCGATT CTATATGAAG ATTAC TATGGTACTACTGCT 
AGTAATGATTC TTCTACT TATT C ATC AAC ACAAGAGAATAATTATAAT ACAAC AC C TTATTCAGAAGCACCAC C AAGT 
TACAGTGGTAATACTACTTATAGT TC TGAGAC TAAT C AAAC AACT CATCAAAATTACTAT AAT AGT AGC ACT CCTGC T 
AGTAACTATAGC AGTAACACTAAC ACAGGTC AGGCTGATTCAAGTGGAAGTGT CAATAAT C ATAACGGGGCTG C AACG 

CCTAATCCA 
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Table 25: Comparative Sequences relating to SAG0368 

(protein of unknown function) 

SEQ ID NO. 2504: SAG0368 FROM THE 2603 V/R GBS TYPE V STRAIN 

TATAATTTTTCGAC TAATGAATTGT CTAAGAC TTT T AAAGATTTTAAGCTAGCTAAATCAAAAAGTC ATGCT ATTGAA 
GAAACAAAGCCGTTTTCAATACTATTAATGGGGGTGGACACAGGTTCAGAGCATCGAAAATCTAAGTGGTCAGGAAAT 
AGCGATTCTATGATCTTAGTCACTATAAATCCTAAAACTAATAAAACAACGATGACAAGCTTAGAACGTGACGTATTG 
ATTAAATTGAGTGGTCCCAAAAATAATGGACAGACTGGAGTAGAAGCAAAGCTAAATGCAGCCTATGCTTCTGGTGGT 
GCGGAAATGGCATTGATGACTGTTCAAGACTTATTAGATATTAATGTTGATTACTTTATGCAAATTAATATGCAAGGA 
TTAGTTGATTTAGTCAATGCTGTTGGTGGTATAACAGTAACTAATAAATTTGACTTTCCAATATCAATTGCTGCCAAT 
GAACC AGAGTAC AAGGCTGTTGTTGAAC CAGGGAC ACATAAAATAAATGGAGAAC AAGCACTTGTTTATT CT CGTATG 
CGC TATGATGATC CAGAGGGAGATTATGGGCGTC AAAAAAGACAACGTGAAGTAATT C AAAAAGTC CTTAAAAAAATA 
TTGGCGTTAAATAGTATTAGTTCATACAAAAAAATTCTTTCCGCAGTAAGTAATAACATGCAAACTAATATTGAGATA 
T CATCAAAAACGATTC C TAATTTGTTAGC TT ATAAAGATTCATTGGAAC ATATTAAATCT TATCAGTTGAAGGK3TGAA 
GACGCTACTTTATCAGATGGTGGCTCTTATCAAATTTTAACTAAGAAACATCTACTTGCAGTTCAAAATAGAATTAAG 
AAAGAACTGGAT AAAAAG CGTAGTAAAACTCTGAAGAC AAGCGCGATT C TATATGAAGAT TACTATGGTACTACTGCT 
AGTAATGATTCTT CTACTT ATT C ATC AACACAAGAGAATAATTATAATACAAC ACCTTAT TCAGAAGCAC CAC CAAGT 
TAC AGTGGTAATACTACTTATAGTTC TGAGACTAAT CAAAC AACTCAT CAAAATTACTATAATAGTAGCACT C CTGCT 
AGTAACTATAGCAGTAACACTAACACAGGTCAGGCTGATTCAAGTGGAAGTGTCAATAATCATAACGGGGCTGCAACG 

CCTAATCCA 

SEQ ID NO. 2505: SAG0368 FROM THE A909 GBS TYPE la STRAIN 

TATAATTTTTCGACTAATGAATTGTCTAAGACTTTTAAAGATTTTAAGCTAGCTAAATCAAAAAGTCATGCTATTGAA 
G AAAC AAAG C C GT T T T C AAT AC T AT TAAT GGGGGTGG AC AC AGGT T C AG AG C ATCGAAAAT C T AAGT GGT C AG GAAAT 
AGCGATTCTATGATCTTAGTCACTATAAATCCTAAAACTAATAAAACAACGATGACAAGCTTAGAACGTGACGTATTG 
ATTAAATTGAGTGGTCCCAAAAATAATGGACAGACTGGAGTAGAAGCAAAGCTAAATGCAGCCTATGCTTCTGGTGGT 
GCGGAAATGGCATTGATGACTGTTCAAGACTTATTAGATATTAATGTTGATTACTTTATGCAAATTAATATGCAAGGA 
TTAGTTGATTTAGTC AATG CTGTTGGTGGTATAAC AGTAAC TAATAAAT TTGACTTT C C AATATCAATTGCTGC CAAT 
GAAC CAGAGTAC AAGG CTGTTGTTGAAC CAGGGAC ACATAAAATAAATGGAGAACAAGCACTTGTTTATTCT CGTATG 
CGC TATGATGAT C C AGAGGGAGAT TATGGGCGTCAAAAAAGACAACGTGAAGTAATTCAAAAAGTC CTTAAAAAAATA 
TTGGCGTTAAATAGTATTAGTTCATACAAAAAAATTCTTTCCGCAGTAAGTAATAACATGCAAACTAATATTGAGATA 
TCATCAAAAACGATTCCTAATTTGTTAGCTTATAAAGATTCATTGGAACATATTAAATCTTATCAGTTGAAGGGTGAA 
GACGCTACTTTATCAGATGGTGGCTCTTATCAAATTTTAACTAAGAAACATCTACTTGCAGTTCAAAATAGAATTAAG 
AAAGAACTGGATAAAAAGCGTAGTAAAACTCTGAAGACAAGCGCGATTCTATATGAAGATTACTATGG;TACTACTGCT 
AGTAATGATTCTT CTACTTAT TCATCAAC AC AAGAGAATAATT AT AAT ACAACAC CTTATTCAGAAGCAC C ACC AAGT 
TAC AGTGGT AAT ACTACTTATAGT TC TGAGACTAATC AAAC AACTC ATC AAAAT TAC TAT AATAGTAGC AC TCCTGCT 
AGTAACTATAGC AGTAAC AC TAAC AC AGGTC AGGCTGATT CAAGTGGAAGTGTCAATAAT C ATAACGGGGCTGCAACG 
CCTAATCCA 

SEQ ID NO. 2506: SAG0368 FROM THE CJB110 GBS NONTYPEABLE STRAIN (REVERSE 
COMPLEMENT ) 

TATAATTTTTCGACTAATGAATTGTCTAAGACTTTTAAAGATTTTAAGCTAGCTAAATCAAAAAGTCATGCTATTGAA 
GAAACAAAGC CGTTTTC AATACTATTAATGGGGGTGGAC ACAGGTT CAGAGC ATCGAAAATCTAAGTGGTC AGGAAAT 
AGCGATTCTATGATCTTAGTCACTATAAATCCTAAAACTAATAAAACAACGATGACAAGCTTAGAACGTGACGTATTG 
ATTAAATTGAGTGGTCCCAAAAATAATGGACAGACTGGAGTAGAAGCAAAGCTAAATGCAGCCTATGCTTCTGGTGGT 
GCGGAAATGGCATTGATGACTGTTCAAGACTTATTAGATATTAATGTTGATTACTTTATGCAAATTAATATGCAAGGA 
TTAGTTGATTTAGTCAATGCTGTTGGTGGTATAACAGTAACTAATAAATTTGACTTTCCAATATCAATTGCTGCCAAT 
GAACCAGAGTACAAGGCTGTTGTTGAACCAGGGACACATAAAATAAATGGAGAACAAGCACTTGTTTATTCTCGTATG 
CGCTATGATGAT C CAGAGGGAGAT TATGGGCGTCAAAAAAGACAACGTGAAGTAATT C AAAAAGTC CTTAAAAAAATA 
TTGG CGTTAAAT AGTATTAGTTC ATAC AAAAAAATTCT TTC CGC AGTAAGTAATAACATG CAAACTAATATTGAGATA 
TCATCAAAAACGATTCCTAATTTGTTAGCTTATAAAGATTCATTGGAACATATTAAATCTTATCAGTTGAAGGGTGAA 
GACGC TACTTTATC AGATGGTGGC TCTTATC AAATT T TAACTAAGAAACAT C T ACTTGC AGTT CAAAATAGAATTAAG 
AAAGAACTGGAT AAAAAGCGTAGTAAAACTCTGAAGACAAGCGCGATTCT AT ATGAAGATTACTATGGTACTACTGCT 
AGTAATGATT CTT C TACTTATTC ATC AACACAAGAGAATAATTAT AAT AC AACAC CT TAT TCAGAAGCAC CAC CAAGT 
TACAGTGGTAATACTACTTATTAGTTCTGAGACTAATCAAACAACTCATCAAAATTACTATAATAGTAGCACTCCTGC 
TAGTAACTATAGCAGTAACACTAACACAGGTCAGGCTGATTCAAGTGGAAGTGTCAATAATCATAACGGGGCTGCAAC 
GCCTAATCCA 
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Table 25: Comparative Sequences relating to SAG0368 

(protein of unknown function) 

SEQ ID NO. 2507: SAG0368 FROM THE COHl GBS TYPE III STRAIN (REVERSE 
COMPLEMENT) 

GATTTTAAGCTAGATAAATCAAAAAGTC^TGCTATTGAAGAAACAAAGCCGTTO^ 

ACAGGTTCAGAGCATCGAAAATCTAAGTGGTCAGGAAATAGCGATTCTATGATCTTAGTCACTATAAATCCTAAAACT 
AATAAAACAACGATGACAAGCTTAGAACGTGACGTATTGATTAAATTGAGTGGTCCCAAAAATAATGGACAGACTGGC 
GTAGAAGCAAAGCTAAATGCAGCCTATGCTTCTGGTGGTGCGGAAATGGCATTGATGACTGTTCAAGACTTATTAGAT 
ATTAATGTTGATTACTTTATGCAAATTAATATGCAAGGATTAGTTGATTTGGTCAATGCTGTTGGTGGTATAACAGTA 
ACTAATAAATTTGACTTTCCAATATCAATTGCTGCCAATGAACCAGAGTACAAGGCTGTTGTTGAACCAGGGACACAT 
AAAATAAATGGAGAACAAGCACTTGTTTATTCTCGTATGCGCTATGATGATCCAGAGGGAGATTATGGGCGTCAAAAA 
AGACAACGTGAAGTAAT TC AAAAAGTC CTTAAAAAAATATTGGCGTTAAATAGTATTAGTT CATACAAAAAAATT CTT 
TCCGCAGTAAGTAATAACATGCAAACTAATATTGAGATATCATCAAAAACGATTCCTAATTTGTTAGCTTATAAAGAT 
TCATTGGAACATATTAAATCTTATCAGTTGAAGGGTGAAGACGCTACTCTATCAGATGGTGGCTCTTATCAAATTTTA 
ACTAAGAAACATCTACTTGCAGTTCAAAATAGAATTAAGAAAGAGCTGGATAAAAAGCGTAGTAAAACTCTGAAGACA 
AGCGCGATTCTATATGAAGATTACTATGGTACTACTGCTAGTAATGATTCTTCTACTTATTCATCAACACAAGAGAAT 
TATTATTATACAACACCCTTATTCAGAAGCACCACCAAGTTACAGTGGTAATACTACTTATAGTTCTGAGACTAATCA 
AAC AACTCAT CAAAGTTACTATAATAGTAGCACT C C TGCTAGTAACTATAGC AGTAACACTAACACAGGTCAGGCTG A 
TTCAAGTGGAAGTGTTAATAATTATAACGGGGCTGC AACGC CTAATC C AAAC AC AGGAAC GC AACCAGTAC C AGGT C A 
AACTAATCCA 

SEQ ID NO. 2508: SAG0368 FROM THE H36b GBS TYPE lb STRAIN 

TATAATTTTTCGACTAATGAATTGTCTAAGACTTTTAAAGATTTTAAGCTAGCTAAATCAAAAAGTCATGCTATTGAA 
GAAACAAAGC CGTTTTCAATACTATTAATGGGGGTGG ACAC AGGTTCAGAGC ATCGAAAATC TAAGTGGTCAGGAAAT 
AGCGATTCTATGATCTTAGTCACTATAAATCCTAAAACTAATAAAACAACGATGACAAGCTTAGAACGTGACGTATTG 
ATTAAATTGAGTGGT CC CAAAAAT AATGGACAGACTGGAGTAGAAGCAAAGC TAAATGC AGC CTATGCTTCTGGTGGT 
GCGGAAATGGCATTGATGACTGTTCAAGACTTATTAGATATTAATGTTGATTACTTTATGCAAATTAATATGCAAGGA 
TTAGTTGATTTAGTCAATGCTGTT GGTGGTATAACAGTAAC TAATAAATTTGACTTT C CAATAT CAATTGC TGC CAAT 
GAAC C AGAGTAC AAGGC TGTTGTT GAACC AGGGAC ACATAAAATAAATGGAGAAC AAGC ACTTGTTTATTCT CGTATG 
CGCTATGATGATCCAGAGGGAGATTATGGGCGTCAAAAAAGACAACGTGAAGTAATTCAAAAAGTCCTTAAAAAAATA 
TTGGCGTTAAATAGTA 

SEQ ID NO. 2509: SAG0368 FROM THE 

TTAGTTCATACAAAAAAATTCTTT CCGCAGTAAGTAATAAC ATGCAAACTAATATTGAGATATC AT CAAAAACGATT C 
C T AATT T GT TAG C T T AT AAAG AT T C AT TGGAACAT AT T AAAT C T TAT C AGT TG AAGGGTG AAG AC G C TACT T TAT C AG 
ATGGTGGCTCTTATCAAATTTTAACTAAGAAACATCTACTTGCAGTTCAAAATAGAATTAAGAAAGAACTGGATAAAA 
AGCGTAGTAAAACTCTGAAGACAAGCGCGATTCTATATGAAGATTACTATGGTACTACTGCTAGTAATGATTCTTCTA 
CTTATTCATCAACACAAGAGAATAATTATAATACAACACCTTATTCAGAAGCACCACCAAGTTACAGTGGTAATACTA 
CTTATAGTTCTGAGACTAATCAAACAACTCATCAAAATTACTATAATAGTAGCACTCCTGCTAGTAACTATAGCAGTA 
ACACTAACACAGGTCAGGCTGATTCAAGTGGAAGTGTCAATAATCATAACGGGGCTGCAACGCCTAATCCA 

SEQ ID NO. 2510: SAG0368 FROM THE JM9130013 GBS TYPE VIII STRAIN (REVERSE 
COMPLEMENT) 

TATAATTTTTCGACTAATGAATTGTCTAAGACTTTTAAAGATTTTAAGCTAGCTAAATCAAAAAGTCATGCTATTGAA 
GAAACAAAGCCGTTTTCAATACTATTAATGGGGGTGGACACAGGTTCAGAGCATCGAAAATCTAAGTGGTCAGGAAAT 
AGCGATTCTATGATCTTAGTCACTATAAATCCTAAAACTAATAAAACAACGATGACAAGCTTAGAACGTGACGTATTG 
ATTAAATTGAGT GGT C C C AAAAATAATGGAC AGACTGGAGTAGAAGCAAAGCTAAATGCAGC CTATGCTTC TGGTGGT 
GCGGAAATGGCATTGATGACTGTTCAAGACTTATTAGATATTAATGTTGATTACTTTATGCAAATTAATATGCAAGGA 
TTAGTTGATTTAGTCAATGCTGTTGGTGGTATAACAGTAACTAATAAATTTGACTTTCCAATATCAATTGCTGCCAAT 
GAACCAGAGTACAAGGCTGTTGTTGAACCAGGGACACATAAAATAAATGGAGAACAAgCACTTGTTTATTCTCGTATG 
CGCTATGATGATCCAGAGGGAGATTATGGGCGTCAAAAAAGACAACGTGAAGTAATTCAAAAAGTCCTTAAAAAAATA 
TTGGCGTTAAATAGTATTAGTTCATACAAAAAAATTCTTTCCGCAGTAAGTAATAACATGCAAACTAATATTGAGATA 
TCATCAAAAACGATTCCTAATTTGTTAGCTTATAAAGATTCATTGGAACATATTAAATCTTATCAGTTGAAGGGTGAA 
GACGCTACTTTAT CAGATGGTGGC TCTTATCAAATT TTAACTAAGAAAC ATCTACTTGCAGT TC AAAATAGAATTAAG 
AAAGAACTGGATAAAAAGCGTAGTAAAACTCTGAAGACAAGCGCGATTCTATATGAAGATTACTATGGTACTACTGCT 
AGTAATGATT C TT CTACTTATTC ATC AAC AC AAGAGAATAATTATAAT AC AACAC C TTATT C AGAAGCACC ACC AAGT 
TAC AGTGGTAATACTACT TATAGTTC TGAGACTAATCAAAC AACT CAT C AAAATTAC TAT AAT AGT AGC AC T CC TGC T 
AGT AAC TATAGC AGTAACACTAAC ACAGGTC AGGCTGATTC AAGTGGAAGTGT CAATAAT C ATAACGGGGC TGCAACG 
C CTAATC C A 
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Table 25: Comparative Sequences relating to SAG0368 

(protein of unknown function) 



SEQ ID NO. 2511: SAG0368 FROM THE M781 GBS TYPE III STRAIN (REVERSE 
COMPLEMENT) 

TTCAATAC T ATTAATGGGTGTGGACACAGGTTCAGAGC ATCGAAAATC TAAGTGGT C AGGAAATAGCGATTC TATGAT 
CTTAGTCACTATAAAT C C TAAAACTAATAAAAC AACGATGACAAGCTTAGAACGTGACGTATTGATTAAATTGAGTGG 
TCCCAAAAATAATGGACAGACTGGCGTAGAAGCAAAGCTAAATGCAGCCTATGCTTCTGGTGGTGCGGAAATGGCATT 
GATGACTGTTCAAGACTTATTAGATATTAATGTTGATTACTTTATGCAAATTAATATGCAAGGATTAGTTGATTTGGT 
CAATGCTGTTGGTGGTATAACAGTAACTAATAAATTTGACTTTCCAATATCAATTGCTGCCAATGAACCAGAGTACAA 
GGCTGTTGTTGAACCAGGGACACATAAAATAAATGGAGAACAAGCACTTGTTTATTCTCGTATGCGCTATGATGATCC 
AGAGGGAGATTATGGGCGTCAAAAAAGACAACGTGAAGTAATTCAAAAAGTCCTTAAAAAAATATTGGCGTTAAATAG 
TATTAGTTCATAC AAAAAAAT TC T TTC CGC AGT AAGTAATAAC ATGC AAAC TAATATTGAGATATCATC AAAAACGAT 
TCCTAATTTGTTAGCTTATAAAGATTCATTGGAACATATTAAATCTTATCAGTTGAAGGGTGAAGACGCTACTCTATC 
AGATGGTGGCTCTTATCAAATTTTAACTAAGAAACATCTACTTGCAGTTCAAAATAGAATTAAGAAAGAGCTGGATAA 
AAAGCGTAGTAAAACTCTGAAGACAAGCGCGATT CTATATGAAGATTAC TATGGTACTAC TGCTAGTAATGATTCTT C 
TACTTATTCATCAACACAAGAGAATAATTATAATACAACACCTTATTCAGAAGCACCACCAAGTTACAGTGGTAATAC 
TACTTATAGTTC TGAGACTAAT C AAAC AAC TCATC AAAGTTACTATAATAGTAGCACTCC TGCTAGTAAC TATAGCAG 
TAACACTAACACAGGTCAGGCTGATTCAAGTGGAAGTGTTAATAATTATAACGGGGCTGCAACGCCTAATCCAAACAC 
AGGAACGC AACCAGTACC AGGT C AAAC TAATC C A 



SEQ2501 
SEQ2502 
SEQ2503 
SEQ2504 
SEQ2505 
SEQ2506 
SEQ2507 
SEQ2508 
SEQ250 9 
SEQ2510 
SEQ2511 



ATTTTAAGCTAGATAAATCAAAAAGTCATGCTATTGAAGAAACAAAGCCGTTTTCAATA 



•TTCAATA 



SEQ2501 
SEQ2502 
SEQ2503 
SEQ2504 
SEQ2505 
SEQ2506 
SEQ2507 
SEQ2508 
SEQ2 50 9 
SEQ2510 
SEQ2511 



TATTAATGGGTGTGGACACAGGTTCAGAGCATCGAAAATCTAAGTGGTCAGGAAATAGC 



TATTAATGGGTGTGGACACAGGTTCAGAGCATCGAAAATCTAAGTGGTCAGGAAATAGC 



SEQ2501 
SEQ2502 
SEQ2503 
SEQ2504 
SEQ2505 
SEQ2506 
SEQ2507 
SEQ2508 
SEQ2509 
SEQ2510 
SEQ2511 



ATTCTATGATCTTAGTCACTATAAATCCTAAAACTAATAAAACAACGATGACAAGCTTA 



ATTCTATGATCTTAGTCACTATAAATCCTAAAACTAATAAAACAACGATGACAAGCTTA 
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Table 25: Comparative Sequences relating to SAG0368 

(protein of unknown function) 



SEQ2501 
SEQ2 502 
SEQ2503 
SEQ2504 
SEQ2505 
SEQ2506 
SEQ2 507 
SEQ2 508 
SEQ2509 
SEQ2510 
SEQ2511 



AACGT<3 AC GTATTGATTAAATTGAGTGGTC C C AAAAATAATGG ACAGAC TGGCGTAGAA 



AACGTGACGTATTGATTAAATTGAGTGGTCCCAAAAATAATGGACAGACTGGCGTAGAA 



SEQ2 5 01 
SEQ2502 
SEQ2503 
SEQ2.5 04 
SEQ2505 
SEQ2 506 
SEQ2507 
SEQ2508 
SEQ2509 
SEQ2510 
SEQ2511 



C AAAGCTAAATGC AGC CTATGC TTC TGGTGGTGCGGAAATGGC ATTGATGACTGTT CAA 



CAAAGCTAAATGCAGCCTATGCTTCTGGTGGTGCGGAAATGGCATTGATGACTGTTCAA 



SEQ2501 
SEQ2502 
SEQ2503 
SEQ2504 
SEQ2 505 
SEQ2506 
SEQ2507 
SEQ2508 
SEQ2509 
SEQ2510 
SEQ2511 



ACTTATTAGATATTAATGTTGATTACTTTATGCAAATTAATATGCAAGGATTAGTTGAT 



ACTTATTAGATATTAATGTTGATTACTTTATGCAAATTAATATGCAAGGATTAGTTGAT 



SEQ2501 
SEQ2502 
SEQ2503 
SEQ2504 
SEQ2505 
SEQ2 50 6 
SEQ2507 
SEQ2508 
SEQ2509 
SEQ2 510 
SEQ2511 



TGGTCAATGCTGTTGGTGGTATAACAGTAACTAATAAATTTGACTTTCCAATATCAATT 



TGGTCAATGCTGTTGGTGGTATAACAGTAACTAATAAATTTGACTTTCCAATATCAATT 



SEQ2501 
SEQ2502 
SEQ2503 
SEQ2504 
SEQ2505 
SEQ2506 
SEQ2507 
SEQ2508 
SEQ2509 
SEQ2510 
SEQ2511 



CTGCCAATGAACCAGAGTACAAGGCTGTTGTTGAACCAGGGACACATAAAATAAATGGA 



CTGCCAATGAACCAGAGTACAAGGCTGTTGTTGAACCAGGGACACATAAAATAAATGGA 
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Table 25: Comparative Sequences relating to SAG0368 

(protein of unknown function) 

SEQ2501 



SEQ2502 

SEQ2503 

SEQ2504 

SEQ2505 

SEQ2506 

S EQ2 507 AACAAGCACTTGTTTATTCTCGTATGCGCTATGATGATCCAGAGGGAGATTATGGGCGT 

SEQ2508 

SEQ2509 

SEQ2510 

SEQ2 511 AACAAGCACTTGTTTATTCTCGTATGCGCTATGATGATCCAGAGGGAGATTATGGGCGT 

SEQ2501 TATAATTTTTCG 

SEQ2 502 --- TATAATTTTTCG 

SEQ2503 TATAATTTTTCG 

SEQ2 504 • TATAATTTTTCG 

SEQ2 505 TATAATTTTTCG 

SEQ2 506 TATAATTTTTCG 

SEQ2 507 AAAAAAGACAACGTGAAGTAATTCAAAAAGTCCTTAAAAAAATATTGGCGTTAAATAGT 

SEQ2508 TATAATTTTTCG 

SEQ2509 

SEQ2 510 TATAATTTTTCG 

SEQ2511 AAAAAAGACAACGTGAAGTAATTCAAAAAGTCCTTAAAAAAATATTGGCGTTAAATAGT 

SEQ2 501 CTAATGAATTGTCTAAGACTTTTAAAGATTTTAAGCTAGC TAAATCAAAAAGTCATGCT 

SEQ2 502 CTAATGAATTGTCTAAGAC TTTTAAAGATTTTAAGCTAGC TAAAT C AAAAAGTC ATG C T 

SEQ2 503 CTAATGAATTGTCTAAGACTTTTAAAGATTTTAAGCTAGCTAAATCAAAAAGTCATGCT 

SEQ2 504 CTAATGAATTGTCTAAGACTTTT AAAGATTTTAAGCTAGCTAAATCAAAAAGTCATGCT 

SEQ2 505 CTAATGAATTGTCTAAGACTTTTAAAGATTTTAAGCTAGCTAAATCAAAAAGTCATGCT 

SEQ2 506 CTAATGAATTGTCTAAGACTTTTAAAGATTTTAAGCTAGCTAAATCAAAAAGTCATGCT 

SEQ2 507 TTAGTTCAT - ACAAAAAAATTCTTTCCGCAGTAAGTAA- * TAACATGCAAACTAATATT 

SEQ2508 CTAATGAATTGTCTAAGACTTTTAAAGATTTTAAGCTAGCTAAATCAAAAAGTCATGCT 

SEQ2509 TTAGTTCAT-ACAAAAAAATTCTTTCCGCAGTAAGTAA- -TAACATGCAAACTAATATT 

SEQ2 510 CTAATGAATTGTCTAAGACTTTTAAAGATTTTAAGCTAGCTAAATCAAAAAGTCATGCT 

SEQ2 5 11 TTAGTTCAT- ACAAAAAAATTCTTTCCGCAGTAAGTAA- - TAACATGCAAACTAATATT 

SEQ2 501 TTGAAGAAACAAAGC CGTTT T C AATACTATTAATGGGGGTGGACAC AGGTTC AGAGCAT 

SEQ2502 TTGAAGAAACAAAGCCGTTTTCAATACTATTAATGGGGGTGGACACAGGTTCAGAGCAT 

SEQ2 503 TTGAAGAAACAAAGCCGTTTTCAATACTATTAATGGGGGTGGACACAGGTTCAGAGCAT 

SEQ2 504 TTGAAGAAACAAAGC CGTTTT CAATACTATTAATGGGGGTGGAC AC AGGTTC AGAGCAT 

SEQ2 505 TTGAAGAAACAAAGCCGTTTTCAATACTATTAATGGGGGTGGACACAGGTTCAGAGCAT 

SEQ2 506 TTGAAGAAACAAAGCC GTTTTC AAT AC TATTAATGGGGGTGGAC ACAGGTTCAGAG CAT 

SEQ2507 AGATATC ATC AAAAACGATTC CTAATTTGTTAGCTTATAAAGAT TC A TTGGAACAT 

SEQ2 508 TTGAAGAAACAAAGC CGTTT TC AATACTATTAATGGGGGTGGACAC AGGTTC AGAGCAT 

SEQ2 509 AGATATCATCAAAAACGATTCCTAATTTGTTAGCTTATAAAGATTCA TTGGAACAT 

SEQ2 510 TTGAAGAAACAAAGCCGTTTTCAATACTATTAATGGGGGTGGACACAGGTTCAGAGCAT 

SEQ2 511 AGATATCATCAAAAACGATTCCTAATTTGTTAGCTTATAAAGATTCA TTGGAACAT 

SEQ2 501 GAAAATCTAAGT - GGTCAGGAAATAGCGATTCTATGATCTTAGTCACTATAAATCCTAA 

SEQ25 0 2 GAAAATCT AAGTTGGTC AGGAAAT AGCGAT TC T ATGAT CTTAGT C AC T ATAAATC C TAA 

SEQ2 503 GAAAATCTAAGT - GGTCAGGAAATAGCGATTCTATGATCTTAGTCACTATAAATCCTAA 

SEQ2504 GAAAATCTAAGT-GGTCAGGAAATAGCGATTCTATGATCTTAGTCACTATAAATCCTAA 

SEQ2 505 GAAAATCTAAGT - GGT CAGGAAATAGC GAT T CTATGAT CT TAGTC ACTATAAATCCTAA 

SEQ2 506 GAAAATCTAAGT- GGTCAGGAAATAGCGATTCTATGATCTTAGTCACTATAAATCCTAA 

SEQ2507 TTAAATCTTATC - AGTTGAAGGGTGAAGACGCTACTCTATCAG - - ATGGTGGCTCTTAT 

SEQ2 508 GAAAATCTAAGT - GGT CAGGAAATAGC GATTCTATGATCT TAGTC ACTATAAATCCTAA 

SEQ2 5 0 9 TTAAATCTTATC - AGTTGAAGGGTGAAGACGCTACTTTATCAG- - ATGGTGGCTCTTAT 

SEQ2510 GAAAATCTAAGT- GGT CAGGAAATAGCGATTCTATGATCT TAGTC ACTATAAATCCTAA 

SEQ2511 TTAAATCTTATC- AGTTGAAGGGTGAAGACGCTACTCTATCAG- -ATGGTGGCTCTTAT 

SEQ2501 ACT AATAAAACAACGATGAC AAGCTTAGAACGTGACGTATTGATTAAATTGAGTGGTCC 
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Table 25: Comparative Sequences relating to SAG0368 

(protein of unknown function) 

ACTAATAAAACAACGATGACAAGCTTAGAACGTGACGTATTGATTAAATTGAGTGGTCC 
ACTAATAAAACAACGATGACAAGCTTAGAACGTGACGTATTGATTAAATTGAGTGGTCC 
ACTAATAAAACAACGATGACAAGCTTAGAACGTGACGTATTGAT TAAATTGAGTGGT C C 
ACTAATAAAACAACGATGACAAGCTTAGAACGTGACGTATTGATTAAATTGAGTGGTCC 
ACTAATAAAACAACGATGACAAGCTTAGAACGTGACGTATTGATTAAATTGAGTGGTCC 
AAATTTTAACTAAGAAACATCTACTTGCAGTTCAAAATAGAATTAAGAAAGAGCTGGAT 
AC T AATAAAAC AACGATGACAAGCTT AGAACGTGACGT ATTGATTAAATTGAGTGGT CC 
AAAT T TTAACTAAGAAAC AT CTACTTGC AGTTC AAAATAGAATT AAGAAAGAACTGGAT 
ACTAATAAAAC AACGATGAC AAGCTTAGAACGTGACGTATTGAT TAAATTGAGTGGTC C 
AAATTTTAAC TAAGAAAC ATCTACTTGCAGTT CAAAATAGAATTAAGAAAGAGC TGGAT 

AAAAAT AATGGACAGACTGGAGTAGAAGC AAAG - - CTAAATGCAGCCTATGCTTCTGGT 
AAAAATAATGGAC AGACTGGCGTAGAAGC AAAG - - CTAAATGCAGCCTATGCTTCTGGT 
AAAAAT AATGGACAGACTGGAGTAGAAGC AAAG- - CTAAATGCAGCCTATGCTTCTGGT 
AAAAAT AATG G AC AG AC TGG AGT AGAAG C AAAG - - CTAAATGCAGCCTATGCTTCTGGT 
AAAAATAATGGAC AG ACTGGAGT AG AAGC AAAG - - CTAAATGCAGCCTATGCTTCTGGT 
AAAAATAATGGAC AGACTGGAGTAGAAGC AAAG - - CTAAATGCAGCCTATGCTTCTGGT 
AAAAGCGTAGTAAAACTCTGAAGACAAGCGCGATTCTATATGAAGATTACTATGGTACT 
AAAAAT AATGGACAGACTGGAGTAGAAGCAAAG - - CTAAATGCAGCCTATGCTTCTGGT 
AAAAGCGTAGTAAAACTCTGAAGACAAGCGCGATTCTATATGAAGATTACTATGGTACT 
AAAAATAATGGAC AGACTGGAGTAGAAGCAAAG - - CTAAATGCAGCCTATGCTTCTGGT 
AAAAG CGT AGT AAAACT C TGAAG ACAAG C G C GAT T C TAT AT G AAGAT T AC T AT GGT AC T 

GTGC - GGAAATGGCATTGATGACTGTTCAAGAC TTATTAGATATTAATGTTG ATTAC TT 
GTGC - GGAAATGGCATTGATGACTGTTCAAGACTTATTAGATATTAATGTTGATTACTT 
GTGC - GGAAATGGC ATTGATGAC TGTT C AAGACT TATTAGATATTAATGTTGATTACTT 
GTGC - GGAAATGGCATTGATGAC TGTT CAAGAC TTATTAGAT ATT AATGTTGATTAC TT 
GTGC - GGAAATGGCATTGATGACTGTTCAAGACTTATTAGATATTAATGTTGATTACTT 
GTGC - GGAAATGGCATTGATGACTGTTCAAGACTTATTAGATATTAATGTTGATTACTT 
CTGCTAGTAATGATTCTTCTACTTATTCATCAAC - ACAAGAGAATTATTATTAT - ACAA 
GTGC - GGAAATGGC ATTGATGAC TGTTCAAGACTTATTAGAT AT TAATGTTGATTAC TT 
CTGCTAGTAATGATTCTTCTACTTATTCATCAAC - ACAAGAGAATAATTATAAT - ACAA 
GTGC - GGAAATGGCATTGATGACTGTTCAAGACTTATTAGATATTAATGTTGATTACTT 
CTGCTAGTAATGATTCTTCTACTTATTCATCAAC - ACAAGAGAATAATTATAAT - ACAA 

ATGCAAATTAATATGCAAGGATTAGTTGATTTAGTCAATGCTGTTGGTGGTATAACAGT 
ATGCAAATTAATATGCAAGGATTAGTTGATTTAGTCAATGCTGTTGGTGGTATAACAGT 
ATGCAAATTAATATGCAAGGATTAGTTGATTTAGTCAATGCTGTTGGTGGTATAACAGT 
SEQ2504 ATGCAAATTAATATGCAAGGATTAGTTGATTTAGTCAATGCTGTTGGTGGTATAACAGT 
SEQ2505 ATGC AAATTAATATGCAAGGATTAGTTGATT T AGT C AATGC TGTTGGTGGT ATAAC AGT 

SEQ2 506 ATGCAAATTAATATGCAAGGATTAGTTGATTTAGTCAATGCTGTTGGTGGTATAACAGT 

SEQ2507 ACCCTTATTCAGAAGCACCACCAAGTTACAGTGGT - AATACTACTTATAGTT CTGA 

SEQ2 508 ATGCAAATTAATATGCAAGGATTAGTTGATTTAGTCAATGCTGTTGGTGGTATAACAGT 

SEQ2 509 ACC - TTATTCAGAAGCACCACCAAGTTACAGTGGT- AATACTACTTATAGTT CTGA 

SEQ2510 ATGCAAATTAATATGCAAGGATTAGTTGATTTAGTCAATGCTGTTGGTGGTATAACAGT 
SEQ2 511 ACC - TTAT TC AGAAGC AC C AC C AAGTTAC AGTGGT - AATACTACTTATAGTT CTGA 

SEQ2 501 AC TAATAAATTTGACTTTC C AATAT C AATTGCTGC C AATGAACC AGAGTACAAGGC TGT 

SEQ2502 ACTAATAAATTTGACTTTCCAATATCAATTGCTGCCAATGAACCAGAGTACAAGGCTGT 

SEQ2503 ACTAATAAATTTGACTTTCCAATATCAATTGCTGCCAATGAACCAGAGTACAAGGCTGT 

SEQ2 504 ACTAATAAATTTGACTTTCCAATATCAATTGCTGCCAATGAACCAGAGTACAAGGCTGT 

SEQ2 505 ACTAATAAATTTGACTTTCCAATATCAATTGCTGCCAATGAACC AGAGTACAAGGC TGT 

SEQ2 506 AC TAAT AAATTTGACTTT CC AATAT C AAT TG C TGC C AATGAAC C AGAGTAC AAGG C TGT 

SEQ2 507 ACTAATCAAAC - AACTCATCAA AGTTAC TAT - AATAG - - TAGCACTCCTGCTAGT 

SEQ2 508- AC TAATAAATTTGACTTTC C AAT ATCAATTGC TGC C AATGAAC C AG AGTACAAGGC TGT 

SEQ2509 - ACTAATCAAAC -AACTCATCAA AATTACTAT -AATAG- -TAGCACTCCTGCTAGT 

SEQ2 510 ACTAATAAATTTGACTTTCCAATATCAATTGCTGCCAATGAACC AGAGTACAAGGCTGT 

SEQ2511 ACTAATCAAAC -AACTCATCAA AGTTAC TAT -AATAG- -TAGCACTCCTGCTAGT 



SEQ2502 
SEQ2503 
SEQ2504 
SEQ2505 
SEQ2506 
SEQ2507 
SEQ2508 
SEQ2509 
SEQ2510 
SEQ2511 

SEQ2 501 
SEQ2502 
SEQ2503 
SEQ2504 
SEQ2 505 
SEQ2506 
SEQ2507 
SEQ2508 
SEQ2509 
SEQ2510 
SEQ2511 

SEQ2501 
SEQ2502 
SEQ2503 
SEQ2504 
SEQ2505 
SEQ2506 
SEQ2507 
SEQ2508 
SEQ2509 
SEQ2510 
SEQ2511 

SEQ2501 
SEQ2502 
SEQ2503 
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SEQ25 01 
SEQ2502 
SEQ2503 
SEQ2504 
SEQ2505 
SEQ25 0 6 
SEQ2507 
SEQ2508 
SEQ2509 
SEQ2510. 
SEQ2511 

SEQ2501 
SEQ2502 
SEQ2503 
SEQ2504 
SEQ2505 
SEQ2506 
SEQ2507 
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Table 25: Comparative Sequences relating to SAG0368 

(protein of unknown function) 

GTTGAACCAGGGACACATAAAATAAATGGAGAACAAGCACTTGTTTATTCTCGTATGCG 
GTTGAACCAGGGACACATAAAATAAATGGAGAACAAGCACTTGTTTATTCTCGTATGCG 
GTTGAACCAGGGACACATAAAATAAATGGAGAACAAGCACTTGTTTATTCTCGTATGCG 
GTTGAACCAGGGACACATAAAATAAATGGAGAACAAGCACTTGTTTATTCTCGTATGCG 
GTTGAACCAGGGACACATAAAATAAATGGAGAACAAGCACTTGTTTATTCTCGTATGCG 
GTTGAACCAGGGACACATAAAATAAATGGAGAACAAGCACTTGTTTATTCTCGTATGCG 
ACTATAGCAGTAACAC - TAACACAGGTCAGGCTGATTCAAGTGGAAGTGTTAATAATTA 
GTTGAACCAGGGACACATAAAATAAATGGAGAACAAGCACTTGTTTATTCTCGTATGCG 
ACTATAGCAGTAACAC - TAACACAGGTCAGGCTGATTCAAGTGGAAGTGTCAATAATCA 
GTTGAAC C AGGGAC AC ATAAAATAAATGGAGAACAAGCAC TTGT TTATTC TCGTATGCG 
ACTATAGCAGTAACAC - TAACACAGGTCAGGCTGATTCAAGTGGAAGTGTTAATAATTA 

TATGATGATCCAGAGGGAGATTATGGGCGTCAAAAAAGACAACGTGAAGTAATTCAAAA 
TATGATGATCCAGAGGGAGATTATGGGCGTCAAAAAAGACAACGTGAAGTAATTCAAAA 
TATGATGATCCAGAGGGAGATTATGGGCGTCAAAAAAGACAACGTGAAGTAATTCAAAA 
TATGATGATCCAGAGGGAGATTATGGGCGTCAAAAAAGACAACGTGAAGTAATTCAAAA 
TATGATGATCCAGAGGGAGATTATGGGCGTCAAAAAAGACAACGTGAAGTAATTCAAAA 
TATGATGATCCAGAGGGAGATTATGGGCGTCAAAAAAGACAACGTGAAGTAATTCAAAA 
AACGGGG C TG C AACG C C T AAT C C AAAC AC AGG AACGC AAC C AGT AC C AGGT C AAAC T AA 
TATGATGATCCAGAGGGAGATTATGGGCGTCAAAAAAGACAACGTGAAGTAATTCAAAA 

AACGGGGCTGCAACGCCTAATCCA 

TATGATGATCCAGAGGGAGATTATGGGCGTCAAAAAAGACAACGTGAAGTAATTCAAAA 
AACGGGGCTGC AACGCCTAATCCAAACAC AGGAACGCAAC C AGTACC AGGTCAAAC TAA 

GT CCTTAAAAAAATATTGGCGTTAAATAGTATTAGTTCATAC AAAAAAATTCTTTC CGC 
GT CC TTAAAAAAATATTGGCGTTAAATAGTAT TAGTTC ATACAAAAAAAT TCTTT C CGC 
GTCCTTAAAAAAATATTGGCGTTAAATAGTATTAGTTCATACAAAAAAATTCTTTCCGC 
GTCCTTAAAAAAATATTGGCGTTAAATAGTATTAGTTCATACAAAAAAATTCTTTCCGC 
GTCC TTAAAAAAATATTGGCGTTAAATAGTATTAGTTCATACAAAAAAATTCTTTC CGC 
GTCCTTAAAAAAATATTGGCGTTAAATAGTATTAGTTCATACAAAAAAATTCTTTCCGC 

CCA 

GTCCTTAAAAAAATATTGGCGTTAAATAGTA 



GT CC TTAAAAAAATATTGGCGTTAAATAGTATTAGTTCATACAAAAAAATTCTTT C CGC 
CCA 

GTAAGTAATAACATGCAAACTAATATTGAGATATCATCAAAAACGATTCCTAATTTGTT 
GTAAGTAATAACATGCAAACTAATATTGAGATATCATCAAAAACGATTCCTAATTTGTT 
GTAAGTAATAACATGCAAACTAATATTGAGATATCATCAAAAACGATTCCTAATTTGTT 
GTAAGTAATAACATGCAAACTAATATTGAGATATCATCAAAAACGATTCCTAATTTGTT 
GTAAGTAATAACATGCAAACTAATATTGAGATATCATCAAAAACGATTCCTAATTTGTT 
GTAAGTAATAACATGCAAACTAATATTGAGATATCATCAAAAACGATTCCTAATTTGTT 



GTAAGTAATAACATGC AAACTAATATTGAGAT ATCATCAAAAACGATT C C TAATTTGT T 



GCTTATAAAGATTCATTGGAACATATTAAATCTTATCAGTTGAAGGGTGAAGACGCTAC 
GC TTATAAAGATTCATTGGAAC ATATTAAATC TTATC AGTTGAAAGGTGAAGACG C TAC 
GCTTATAAAGATTCATTGGAACATATTAAATCTTATCAGTTGAAGGGTGAAGACGCTAC 
GCTTATAAAGATTCATTGGAACATATTAAATCTTATCAGTTGAAGGGTGAAGACGCTAC 
GCTTATAAAGATTCATTGGAACATATTAAATCTTATCAGTTGAAGGGTGAAGACGCTAC 
G C T T AT AAAG AT T C AT T GGAAC AT AT T AAATC T TAT C AGT T G AAGG GT G AAG AC G C TAC 



GCTTATAAAGATTCATTGGAACATATTAAATCTTATCAGTTGAAGGGTGAAGACGCTAC 
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Table 25: Comparative Sequences relating to SAG0368 

(protein of unknown function) 

TTATCAGATGGTGGCTCTTATCAAATTTTAACTAAGAAACATCTACTTGCAGTTCAAAA 
TTATCAGATGGTGGCTCTTATCAAATTTTAACTAAGAAACATCTACTTGCAGTTCAAAA 
TTATCAGATGGTGGCTCTTATCAAATTTTAACTAAGAAACATCTACTTGCAGTTCAAAA 
TTATCAGATGGTGGCTCTTATCAAATTTTAACTAAGAAACATCTACTTGCAGTTCAAAA 
TTATCAGATGGTGGCTCTTATCAAATTTTAACTAAGAAACATCTACTTGCAGTTCAAAA 
TTATCAGATGGTGGCTCTTATCAAATTTTAACTAAGAAACATCTACTTGCAGTTCAAAA 



TT AT C AGATGGTGGC T CTTAT CAAATTTTAAC TAAGAAAC ATCT AC TTGCAGTTC AAAA 



AGAATTAAGAAAGAACTGGATAAAAAG CGTAGTAAAAC TCTGAAGACAAGCGCGATT C T 
AGAATTAAGAAAGAACTAGATAAAAAGCGTAGTAAAACTCTGAAGACAAGCGCGATTCT 
AGAATTAAGAAAGAACTGGATAAAAAGCGTAGTAAAACTCTGAAGACAAGCGCGATTCT 
AGAATTAAGAAAGAACTGGATAAAAAGCGTAGTAAAACTCTGAAGACAAGCGCGATTCT 
AGAATTAAGAAAGAAC TGGATAAAAAGCGTAGTAAAACT CTGAAGACAAGCGCGATTCT 
AGAATTAAGAAAGAAC TGGATAAAAAGCGTAGTAAAAC TCTGAAGACAAGCGCGATTC T 



AGAATTAAGAAAGAAC TGGAT AAAAAGCGTAGTAAAAC TCTGAAGAC AAGCGCGATTCT 



TATGAAGATTACTATGGTACTACTGC TAGTAATGATTCTT CTACTTATTCATC AAC AC A 
TATGAAGATTACTATGGTACTACTGCTAGTAATGATTCTTCTACTTATTCATCAACACA 
TATGAAGATTACTATGGTACTACTGCTAGTAATGATTCTTCTACTTATTCATCAACACA 
TATGAAGATTACTATGGTAC T AC TGCTAGTAATGATTC TTCTACTTATTCATC AAC AC A 
TATGAAGATTACTATGGTAC TACTGCTAGTAATGATTCTTCTAC TTATTCATCAAC ACA 
TATGAAGATTACTATGGTACTACTGCTAGTAATG ATT CTT CTAC TTATT CATC AAC ACA 



TATGAAGATTACTATGGTACTACTGCTAGTAATGATTCTTCTACTTATTCATCAACACA 



GAGAAT AATTATAATAC AAC AC CTTATTC AGAAGC AC C ACCAAGTTACAGTGGTAATAC 
GAGAATAATTATAATACAACACCTTATTCAGAAGCACCACCAAGTTACAGTGGTAATAC 
GAGAATAATTATAATACAAC AC CTTATT CAGAAGCAC C AC C AAGTTAC AGTGGTAATAC 
GAGAATAATTATAATACAACACCTTATTCAGAAGCACCACCAAGTTACAGTGGTAATAC 
GAGAATAATTATAATACAACACCTTATTCAGAAGCACCACCAAGTTACAGTGGTAATAC 
GAGAATAATTATAATACAACACCTTATTCAGAAGCACCACCAAGTTACAGTGGTAATAC 



GAGAATAATTATAATACAAC AC CTTATTCAGAAGCAC C AC C AAGTTACAGTGGTAATAC 



ACTTAT - AGT T CTGAGACT AAT CAAAC AAC T CAT C AAAATTACTATAATAGTAGC ACT C 
ACTTAT - AGTTCTGAGACTAATCAAACAACTCATCAAAGTTACTATAATAGTAGCACTC 
ACTTAT - AGTTCTGAGACTAATCAAACAACTCATCAAAATTACTATAATAGTAGCACTC 
ACTTAT - AGT T CTGAGACTAATC AAACAAC T C ATC AAAAT TAC TATAATAGT AGC ACT C 
ACTTAT- AGTTCTGAGACTAATCAAACAACTCATCAAAATTACTATAATAGTAGCACTC 
ACTTATTAGTT CTGAGACTAATC AAACAACT CATC AAAATTACTATAATAGTAGC ACT C 



ACTTAT - AGTTC TGAGAC T AAT CAAAC AACT CATCAAAATTAC TAT AAT AGT AGC ACTC 
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TGCTAGTAACTATAGCAGTAACACTAACACAGGTCAGGCTGATTCAAGTGGAAGTGTCA 
TGCTAATAACTATAGCAGTAACACTAACACAGGTCAGGCTGATTCAAGTGGAAGTGTCA 
TGCTAGTAACTATAGCAGTAACACTAACACAGGTCAGGCTGATTCAAGTGGAAGTGTCA 
TGCTAGTAACTATAGCAGTAACACTAACACAGGTCAGGCTGATTCAAGTGGAAGTGTCA 
TGCTAGTAACTATAGCAGTAACACTAACACAGGTCAGGCTGATTCAAGTGGAAGTGTCA 
TGCTAGTAACTATAGCAGTAACACTAACACAGGTCAGGCTGATTCAAGTGGAAGTGTCA 



TGCTAGTAACTATAGCAGTAACACTAACACAGGTCAGGCTGATTCAAGTGGAAGTGTCA 
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TAATCATAACGGGGCTGCAACGCCTAATCCA 
TAATCATAATGGGGCTGCAACGCCTAATCCA 
TAATCATAACGGGGCTGCAACGCCTAATCCA 
TAATCATAACGGGGCTGCAACGCCTAATCCA 
TAATCATAACGGGGCTGC AACGC C TAAT CCA 
TAATCATAACGGGGC TGC AACGC CTAATCCA 



TAATCATAACGGGGCTGCAACGCCTAATCCA 



>SEQ ID NO 2550: 54_090 frame: 1 

l^FSTNELSKTFKDFKIiAKSKSHAIEETKPFSILLMGVDTGSEHRKSKWSGNSDSMILVT 
INPKTnsTKTTMTSLERDVLIKLSGPKIOTGQTGVEAKLNAA 

DYFMQI NMQGLVDLVNAVGGI TVTNKFDF P I S I AANEP E YKAWEPGTHKINGEQALVYS 
RMR YDDPEGD YGRQKRQREVI QKVLKKILALNS I S S YKKI LS AVSNNMQTNI E I S S KTI P 
NLIiAYKDSLEHIKSYQLKGEDATLSDGGSYQILTKKHLLAVQNRIKKELDKKRSKTLKTS 
AILYEDY^GTTASNDSSTYSSTQE^YNTTPYSEAPPSYSGNTTYSSETN^QTTHQitrrYNS 
S TPASNYS SNTNTGQADS S GSVNNHNGAATPNP 

>SEQ ID NO 2551s 54_1169NT frame: 1 

YNFSTNELSKTFKDFKLAKSKSHAIEETKPFSILLMGVDTGSEHRKSKLVRK . RFYDLSH 
YKS .N. . NNDDKLRT , RID . IEWSQK . WTDWRRS KAKCS L CFWWCGNG IDDCSRLI RY . C 
. LLYAN . YARIS . FSQCCWWYNSN . . I . LSNINCCQ . TRVQGCC . TRDT . NKWRTS TCLF 
SYAL . . SRGRLWASKKTT . SNSKSP . KNIGVK . Y - FIQKNSFRSK . .HAN. Y.DIIKNDS 
. FVSL . RFIGTY . ILSVER . RRYFIRWWLLSNFN . ETSTCSSK . N . ERTR . KA . . NSEDK 
RDSI . RLLWYYC . . . FFYLFINTRE . L . YNTLFRSTTKLQW. YYL . F . D . SNNSSKLL . . 
.HSC. .L.Q.H.HRSG.FKWKCQ.S.WGCNA.S 

>SEQ ID NO 2552.*54_I8RS21 frame: 1 

YNFSTNELSKTFKDFIO^AKSKSHAIEETKPFSILLMGVDTGSEHRKSKWSGNSDSMILVT 
INPKTNKTTMTSLERDVDIKLSGPKNNGQTGVEAKLNAAYASGGAEMALMTVQDLLDINV 
D YFMQINMQGLVDLVNAVGGI TVTNKFDFP I S IAANEPE YKAWEPGTHKI NGEQALVYS 
RMRYDDPEGDYGRQKRQREVIQKVLKKILALNS I SSYKKI LSAVSNNMQTNIEI SSKTIP' 
NLLAYKDSLEHIKSYQLKGEDATLSDGGSYQILTKKHLLAVQNRIKKELDKKRSKTIiKTS 
AILYEDYYGTTAS1TOSSTYSSTQENNYNTTPYSEAPPSYSGNTTYSSETNQTTHQNYYNS 
STPASNYS SNTNTGQADS S GSVNNHNGAATPNP 



>SEQ ID NO 2553 :54_2 603 frame: 1 

YNFSTNELSKTFKDFKLAKSKSHAIEETKPFSILLMGVDTGSEHRKSKWSGNSDSMILVT 
INPKTNKTTMTSLjERDVLIKBSGPKNNGQTGV^ 

DYFMQINMQGLVDLVNAVGGITVTNKFDFPI S IAANEPE YKAWEPGTHKI NGEQALVYS 
RMRYDDPEGDYGRQKRQREVIQKVLKKIIiALNSISSYKKILSAVSNNMQTNIEI SSKTIP 
NLLA YKD S L EH I KS YQ L KGED ATL SDGGS YQ I LTKKHL LA VQNR I KKE LDKKR S KTLKTS 

AILYEDYYGTTASNDSSTYSSTQENNYNTTPYSEAPPSYSGNTTYSSETNQTTHQNYYNS 
STPASNYS SNTNTGQADS S GSVNNHNGAATPNP 

>SEQ ID NO 2554: 54_A909 frame: 1 

YNFSTNELSKTFKDFKXAKSKSHAIEETKPFSILLMGVDTGSEHRKSKWSGNSDSMILVT 
INP KTNKTTMTSLERDVL I KLSGPK87NGQTGVEAKLNAA YASGGAEMALMTVQDLLDINV 
DYFMQ I NMQGLVDLVNAVGGI TVTNKFDF P I S IAANEPE YKAWEPGTHKI NGEQALVYS 
RMRYDDPEGDYGRQKRQREVIQKVLKKILALNSI SS YKKI LSAVSNNMQTNIEI SSKTI P 
NLLA YKDS LEHI KS YQLKGEDATLSDGGS YQI LTKKHLLA VQNR I KKELDKKRS KTLKTS 
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AI L YED YYGTTASNDS ST YS S TQENNYNTTP YSEAP P S YS GNTT YS S ETNQTTHQNY YNS 
STPASNYSSNTNTGQADSSGSVNNHNGAATPNP 



>SEQ ID NO 2555:54_CJB110 frame: 1 

YNFSTNELSKTFKDFKLAKSKSHAIEETKPFSILLMGVDTGSEHRKSiavSGNSDSMILVT 
INPKTNKTTMTSLERDVLIKLSGPKNNGQTGVEAKLNAAYASGGAEMALMTVQDLLDINV 
DYFMQINMQGLVDLVNAVGGI TVTNKFDFP I S IAANEPE YKAWEPGTHKINGEQALVYS 
RMRYDDPEGDYGRQKRQREVI QKVLKKILALNS I S S YKKILSAVSNNMQTNIEI SS KTI P 
^I^YKDSLEHIKSYQLKGEDATLSDGGSYQILTKKHLLAVQNRIKKELDKKRSKTLKTS 
AILYEDYYGTTASNDSSTYSSTQENNYNTTPYSEAPPSYSGNTTY . F.D. SNNSSKLIi . . 

>SEQ ID NO 2556:54_COHl frame: 1 

DFKLDKSKSHAIEETKPFSILLMGVDTGSEHRKSKWSGNSDSMILVTINPKTNKTTMTSL 
ERDVL I KLS GPKNNGQTGVEAKLNAAYAS GGAEMALMTVQDLLD I NVD YFMQI NMQGLVD 
LVNAVGGI TVTNKFDFP I S I AANE PE YKAWE PGTHKI NGEQALVYS RMRYDDPEGD YGR 
QKRQREVI QKVLKKI LALNS I S S YKKI LS AVSNNMQTNI E I S SKTI PNLLA YKDSLEHI K 
SYQLKGEDATLSDGGSYQILTKKHLLAVQNRIKKELDKKRSKTLKTSAILYEDYYGTTAS 
NDS ST YSS TQENYY YTTPLFRS TTKLQW . YYL .F.D. SNNSSKLL . . . HSC . .L.Q.H.H 
RSG . FKWKC . RGCNA . SKHRNATSTRSN . S 

>SEQ ID NO 2557:54_H36B frame: X 

YNFSTNELSKTFKDFKLAKSKSHAIEETKPFSILLMGVDTGSEHRKSKWSGNSDSMILVT 
INPKTNKTTMTSLERDVLIKLSGPKNNGQTGVEAKLNAAYA^ 

D YFMQ I NMQGLVDLVNAVGGI TVTNKFDF P I S IAANEPE YKAWEPGTHKINGEQALVYS 
RMRYDDPEGDYGRQKRQREVIQKVLKKILALNS I S S YKKILSAVSNNMQTNIEI SS KTI P 
NLLAYKDSLEHIKSYQLKGEDATLSDGGSYQILTKKHLLAVQNRIKKELDKKRSKTLKTS 
AI L YED YYGTTASNDS S T YS STQENNYNTTP YSEAPP S YS GNTT YS S ETNQTTHQNY YNS 
STPASNYS SNTNTGQADSSGSVNNHNGAATPNP 

>SEQ ID NO 2558:54_JM9130013 frame: 1 

YNFSTNELSKTFKDFKLAKSKSHAIEETKPFSILLMGVDTGSEHRKSKWSGNSDSMILVT 
INPKTNKTTMTSLEPJDVLIKLSGPKNNGQTGVEAKLNAAYASGGAEMALMTVQDLLD 
DYFMQINMQGLVDLVNAVGGI TVTNKFDFP I S IAANEPEYKAWEPGTHKINGEQALVYS 
RMRYDDPEGDYGRQKRQREVIQKVLKKILALNS I SS YKKILSAVSNNMQTNIEI SSKTIP 
NLLAYKDS LEHI KS YQLKGEDATLSDGGS YQI LTKKHLLAVQNRI KKELDKKRS KTLKT S 
AILYEDYYGTTASNDSSTYSSTQENNYNTTPYSEAPPSYSGNTTYSSETNQTTHQNYYNS 
STPASNYS SNTNTGQADS SGSVNNHNGAATPNP 

>SEQ ID NO 2559:54_M781 frame: 2 

S ILLMGVDTGSEHRKSKWSGNSDSMILVTINPKTNKTTMTSLERDVLI KLSGPKNNGQTG 
VEAKLNAAYAS GGAEMALMTVQDLLD INVDYFMQINMQGLVDLVNAVGGI TVTNKFDF P I 
SIAANEPEYKAVVEPGTHKINGEQALVYSRMRYDDPEGDYGRQKRQREVIQKV^ 
NSISSYKKILSAVSNNMQTNIEISSKTIPNLLAYKDSLEHIKS YQLKGEDATLSDGGS YQ 
ILTKIQSLLAVQNR I KKELDKKRS KTLKTS AIL YED YYGTTASNDS ST YSS TQENNYNTTP 
YSEAPP SYS GNTT YSSETNQTTHQSYYNS STPASNYS SNTNTGQADS SGSVNNYNGAATP 
NPNTGTQPVPGQTNP 



SEQ2550 NFSTNELSKTFKDFKLAKSKSHAIEETKPFSILLMGVDTGSEHRKSKWSGNSDSMILVT 

SEQ2551 NFS TNELS KTFKDF KLAKS KSHAI EETKPFS I LLMGVDTGS EHRKS KLVRKRF YDL SHY 

SEQ2552 NFSTNELSKTFKDFKIiAKSKSHAI EETKPFS I LLMGVDTGSEHRKSKWSGNSDSMILVT 

SEQ2553 NFSTNELSKTFKDFKLAKSKSHAIEETKPFSILLMGVDTGSEHRKSKWSGNSDSMILVT 

SEQ2 554 NFSTNELS KTFKDF KLAKS KSHAI EETKPFS I LLMGVDTGS EHRKS KWSGNSDSMILVT 

SEQ2 555 NFSTNELSKTFKDFKIiAKS KSHAI EETKPFS ILLMGVDTGSEHRKS KWSGNSDSMILVT 

SEQ2556 DFKLDKS KSHAI EETKPFS ILLMGVDTGSEHRKS KWSGNSDSMILVT 

SEQ2557 NFSTNELSKTFKDFKLAKS KSHAI EETKPFS ILLMGVDTGSEHRKS KWSGNSDSMILVT 

SEQ2558 NFSTNELSKTFKDFKLAKS KSHAIEETKPFS ILLMGVDTGSEHRKS KWSGNSDSMILVT 

SEQ2559 SI LLMGVDTGS EHRKS KWSGNSDSMILVT 

SEQ2 55 0 NPKTNKTTMTS LERDVL I KIjS GPKNNGQTGVEAKLNAAYAS GGAEMALMTVQDLLD INV 

SEQ2 551 SNNNDDKLRTRIDI EWSQKWTDWRRS KAKCSLCFWWCGNGIDDCSRLIRYCLLY 

SEQ2552 NPKTNKTTMTS LERDVLI KLS GPKNNGQTGVEAKLNAAYAS GGAEMALMTVQDLLD INV 

SEQ2553 NPKTNKTTMTS LERDVL I KLS GPKNNGQTGVEAKLNAAYAS GGAEMALMTVQDLLD INV 

SEQ2 554 NPKTNKTTMTS LERDVLI KLSGPKNNGQTGVEAKLNAAYASGGAEMALMTVQDLLD INV 

SEQ2555 NPKTNKTTMTSLERDVLIKLSGPKNNGQTGVEAKLNAAYASGGAEMALMTVQDLLDINV 

SEQ2 556 NPKTNKTTMTSLERDVL I KLS GPKNNGQTGVEAKLNAAYAS GGAEMALMTVQDLLD INV 

SEQ2 557 NPKTNKTTMTSLERDVLI KLSGPKNNGQTGVEAKLNAAYASGGAEMALMTVQDLLDINV 
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SEQ2558 NPKTmCTTMTSIiERDVIiIKIiSGPKNNGQTGVEAKLNAAYASGGAEMALMTVQDLLDINV 

SEQ2 559 NPKTNKTTMTS LERDVLI KLS GPKNNGQTGVEAKLNAA YASOT 

SEQ2550 YPMQINMQGLVDLVNAVGGITVTNKFDFPISIAANEPEYKAVVEPGTHKINGEQALVYS 

SEQ2551 NYARISFSQCCWWYNS -NILSNINCCQTRVQGCCTRDTNKWRTSTCLFSY 

SEQ2 552 YFMQINMQGLVDLWAVGGITVTNKFDFPISIAANEPEYKAVVEPGTHKINGEQALVYS 

SEQ255 3 YFMQINMQGLVDLVNAVGGITVTNKFDFPISIAANEPEYKAVVEPGTHKINGEQALVYS 

SEQ2 554 YFMQINMQGLVDLVNAVGGITVTNKFDFP IS I AANEPE YKAWEPGTHKINGEQALVYS 

SEQ2555 YFMQINMQGLVDLVNAVGGITVTNKFDFP I S I AANEPE YKAWEPGTHKINGEQALVYS 

SEQ2 5 5 6 YFMQINMQGLVDLVNAVGGITVTNKFDFP IS I AANEPE YKAWEPGTHKINGEQALVYS 

SEQ2 557 YFMQINMQGLVDLVNAVGGITVTNKFDFP IS I AANEPE YKAWEPGTHKINGEQALVYS 

SEQ2 558 YFMQINMQGLVDLVNAVGGI TVTNKFDFPIS I AANEPE YKAWEPGTHKINGEQALVYS 

SEQ2 559 YFMQINMQGLVDLVNAVGGI TVTNKFDFPIS I AANEPE YKAWEPGTHKINGEQALVYS 

SEQ2550 MRYDDPEGDYGRQKRQREVI QKVLKKI LALNS I SS YKKI LSAVSNNMQTNI EI S SKTI P 

SEQ2551 LSRGRLWASKKTTSNSKSPKNIGVKYFIQKNSFRSKHANYDIIKNDSFVSLRFIGTYI- 

SEQ2552 MRYDDPEGDYGRQKRQREVI QKVLKKI LALNS I SS YKKI LSAVSNNMQTNI EI SSKTI P 

SEQ2553 MRYDDPEGDYGRQKRQREVI QKVLKKI LALNS I SS YKKI LSAVSNNMQTNI EISSKTIP 

SEQ2554 MRYDDPEGDYGRQKRQREVI QKVLKKI LALNS I SS YKKI LSAVSNNMQTNI EISSKTIP 

SEQ2555 MRYDDPEGDYGRQKRQREVI QKVLKKI LALNS I S S YKKI LSAVSNNMQTNI E I SS KT I P 

SEQ2 556 MRYDDPEGDYGRQKRQREVI QKVLKKI LALNS I S S YKKI LSAVSNNMQTNI EI S SKTI P 

SEQ2557 MRYDDPEGDYGRQKRQREVI QKVLKKI LALNS I S S YKKI LSAVSNNMQTNI EI SSKTI P 

SEQ2558 MRYDDPEGDYGRQKRQREVIQKVLKKI LALNS I S S YKKILSAVSNNMQTNIEI S SKTI P 

SEQ2559 MRYDDPEGDYGRQKRQREVI QKVLKKILALNS I SS YKKILSAVSNNMQTNI EISSKTI P 

SEQ2 55 0 LLAYKDSLEHIKSYQLKGEDATLSDGGS YQILTKKHLLAVQNRIKKELDKKRSKTLKTS 

SEQ2551 L-SVERRRYFIRWWLLSNFNETSTCSSKNERTRKANSEDKRDSIRLLWYYCFFYLFINT 

SEQ2 552 LLAYKDSLEHI KS YQLKGEDATLSDGGS YQILTKKHLLAVQNR I KKELDKKRSKTLKTS 

SEQ2553 LLAYKDSLEHI KSYQLKGEDATLSDGGSYQILTKKHLLAVQNRIKKELDKKRSKTLKTS 

SEQ2554 LLAYKDSLEHI KSYQLKGEDATLSDGGSYQILTKKHLLAVQNRIKKELDKKRSKTLKTS 

SEQ2555 LLAYKDS LEHI KS YQLKGEDATLSDGGS YQ I LTKKHLLAVQNR I KKELDKKRSKTLKTS 

SEQ2556 LIAYKDSLEHIKSYQLKGEDATLSDGGSYQILTKKHLLAVQNRIKKELDKKRSKTLKTS 

SEQ2557 LLAYKDS LEHI KSYQLKGEDATLSDGGSYQ I LTKKHLIiAVQNRI KKELDKKRSKTLKTS 

SEQ2 558 LLAYKDSLEHIKS YQLKGEDATLSDGGS YQILTKKHLLAVQNR I KKELDKKRSKTLKTS 

SEQ255 9 LLAYKDSLEHIKS YQLKGEDATLSDGGS YQILTKKHLLAVQNRIKKELDKKRSKTLKTS 

SEQ2550 IL YED Y YGTTASNDSS TYS STQENNYNTTP YS EAP P S YSGNTT YS S ETNQTTHQNY YNS 

SEQ2551 ELY NTLFRST TKLQWYYLFDSNNSSKLLHSCLQH 

SEQ2552 ILYEDYYGTTASNDSSTYSSTQENNYNTTPYSEAPPSYSGNTTYSSETNQTTHQNYYNS 

SEQ2553 I L YED YYGTTASNDS S TYS STQENNYNTTP YS EAP P S YS GNTT YS S ETNQTTHQNY YNS 

SEQ2554 IL YED Y YGTTASNDSS TYS S TQENNYNTTP YS EAP PS YS GNTT YS S ETNQT THQNY YNS 

SEQ2555 ILYED YYGTTASNDS S TYS S TQENNYNTTP YS EAPP S YS GNTT YFD SNNS S KLL 

SEQ255 6 ILYED Y YGTTASNDSS TYSSTQENYYYTTPLFRSTTKLQWYYLFDSNNSSKLLHSCLQH 

SEQ2557 ILYEDYYGTTASNDSSTYSSTQENNYNTTPYS EAPPS YSGNTT YSS ETNQTTHQNY YNS 

SEQ2558 ILYED YYGTTASNDSSTYSSTQENNYNTTPYSEAPPSYSGNTTYSSETNQTTHQNYYNS 

SEQ2559 ILYEDYYGTTASNDSSTYSSTQENNYNTTPYSEAPPSYSGNTTYSSETNQTTHQSYYNS 

SEQ255 0 TPASNYSSNTNTGQADSSGSVNNHNGAATPNP 

SEQ2 55 1 RSGFKWKCQSWGCNAS 

SEQ2552 TPASNYSSNTNTGQADSSGSVNNHNGAATPNP- 

SEQ2 553 TPASNYSSNTNTGQADSSGS VNNHNGAATPNP 

SEQ2554 TPASNYSSNTNTGQADSSGS VNNHNGAATPNP 

SEQ2555 

SEQ2 556 RSGFKWKCLRGCNASKHRNATSTRSNS 

SEQ2 55 7 TPASNYSSNTNTGQADSSGS VNNHNGAATPNP 

SEQ255 8 TPASNYSSNTNTGQADSSGS VNNHNGAATPNP 

SEQ2 559 TPASNYSSNTNTGQADS SGSVNNYNGAATPNPNTGTQPVPGQTNP 



615 



WO 2004/018646 



PCT/US2003/026827 



Table 26: Comparative Sequences relating to SAG0503 (lipase/acylhydolase) 

SEQ ID NO. 2601: SAG0503 FROM THE 090 GBS TYPE la STRAIN 
(REVERSE COMPLEMENT) 

GGGCACAAGTTTGTACAAAAAAGCAGGCTCTATTTTTTCCTTGATCATTCCAAAATCAAATCCTAAATTAACAAAAAA 
AGACTTCCTAACAAAGAAAGTTATCCCACTTAACTATGTTGCTCTTGGAGATTCTCTGACCGAAGGTGTGGGCGATAC 
AACCTCTCAAGGTGGTTTTGTCCCACTGCTATCAGAATCACTCCATAATCGATACTCTTACCAAGTGACTTCTGTTAA 
TTATGGTGTGT C TGGGAATACTAGTCAACAAATTTT AAAACGTATGACGA£AGATCCTC AAAT CGAAAAAGATTTAGA 
GAAAGCTGATTTATTGACGCTAACTGTTGGTGGTAATGATGTCTTGGCTGTTATTCGTAAAGAGCTCAGTCATTTATC 
ACTAAATTC C TTTGAGAAAC C AGC AGAAGCATATAAGGAACGTTTGAAAGAAATACTTGCAAAAGC AAGACAAGATAA 
TCCTAAATTGCCTATTTATGTTTTAGGCATTTATAATCCTTTTTACCTAAACTTTCCACAATTAACTAAAATGCAAAC 
CGT TATTGATAAT TGGAATAAAGC TACAAAAGAAGTAGTTGATGCTTCAGAAAATGTTTATTTTGTCCCAATTAATGA 
CCGC CTTTATAAGGGAATAAATGGTAAAGAGGGTAT TACAGAGT CATC AAAT AGTCAGGC AAGTATCACTAATGATGC 
TC T CTTT ACTGG AGAC CAT TTTCATC CC AATAATATTGGC TATCAAATCATGTC TAACGC CGT TATGGAGAAAATAAA 
TGAAACAAGAAAAAACTGGCCGAACCCAGCTTTCTTGTACAAAG 

SEQ ID NO. 2602: SAG0503 FROM THE H36b GBS TYPE lb STRAIN 
(REVERSE COMPLEMENT) 

TTTGTACAAAAAAGCAGGCTCTATTTTTTCCTTGATCATTCCAAAATCAAATCCTAAATTAACAAAAAAAGACTTCCT 
AACAAAGAAAGTTATCCCACTTAACTATGTTGCTCTTGGAGATTCTCTGACCGAAGGTGTGGGCGATACAACCTCTCA 
AGGTGGTTTTGTTCCACTGCTATCAGAATCACTCCATAATCGATACTCTTACCAAGTGACTTCTGTTAATTATGGTGT 
GTCTGGGAATACTAGTCAACAAATTTTAAAACGTATGACGACAGATCCTCAAATCGAAAAAGATTTAGAGAAAGCTGA 
TTTATTGACGCTAACTGTTGGTGGTAATGATGTCTTGGCTGTTATTCGTAAAGAGCTCAGTCATTTATCACTAAATTC 
C TTTGAGAAAC C AGC AGAAGCATATAAGGAACGTTTGAAAGAAATCC TTGCAAAAGCAAGACAAGATAATCCTAAATT 
GCCTATTTATGTTTTAGGCATTTATAATCCTTTTTACCTAAACTTTCCACAATTAACTAAAATGCAAACCGTTATTGA 
TAATTGGAATAAAGCTACAAAAGAAGTAGTTGATGCTTCAGAAAATGTTTATTTTGTCCCAATTAATGACCGCCTTTA 
TAAGGGAATAAATGGTAAAGAGGGTATTATAGAGTCATCAAATAGTCAGGCAAGTATCACTAATGATGCTCTCTTTAC 
TGGAGAC C ATTT TCATC CCAATAATATTGGCTATC AAATCATGTCTAACGC CGTTATGGAGAAAATAAATGAAACAAG 
AAAAAACTGGCCGAACCCAGCTTTCTTGTACAAAGTGGTCC 

SEQ ID NO. 2603: SAG0503 FROM THE 18RS21 GBS TYPE II STRAIN (REVERSE 
COMPLEMENT) 

GTTTGTACAAAAAAGCAGGCTCTATTTTTTCCTTGATCATTCCAAAATCAAATCCTAAATTAACAAAAAAAGACTTCC 
TAAC AAAGAAAGTTAT CC C ACTTAACTATGTTGCTC TTGGAGATTCTC TGAC CGAAGGTGTGGGCGATACAACCTCTC 
AAGGTGGTTTTGTTCCACTGCTATCAGAATCACTCCATAATCGATACTCTTACCAAGTGACTTCTGTTAATTATGGTG 
TGTCTGGGAATACTAGTCAACAAATTTTAAAACGTATGACGACAGATCCTCAAATCGAAAAAGATTTAGAGAAAGCTG 
ATTTATTGACGCTAACTGTTGGTGGTAATGATGTCTTGGCTGTTATTCGTAAAGAGCTCAGTCATTTATCACTAAATT 
C CTT TGAGAAAC C AGCAGAAGCATATAAGGAACGTT TGAAAGAAATCC TTGC AAAAGCAAGAC AAGATAATC CTAAAT 
TGCCTATTTATGTTTTAGGCATTTATAATCCTTTTTACCTAAACTTTCCACAATTAACTAAAATGCAAACCGTTATTG 
ATAATTGGAATAAAGC TACAAAAGAAGTAGTTGATG C TTCAGAAAATGTTTATTTTGTC C C AATTAATGACCGC CTTT 
ATAAGGGAATAAATGGTAAAGAGGGTATTACAGAGTCATCAAATAGTCAGGCAAGTATCACTAATGATGCTCTCTTTA 
C TGGAGACCATTTTC ATC C CAAT AATATTGGCTATC AAATCATGT CTAACGCCGTTATGGAGAAAATAAATGAAACAA 
GAAAAAACTGGCCGAACCCAGCTTTCTTGTACAA 

SEQ ID NO. 2604: SAG0503 FROM THE COHl GBS TYPE III STRAIN (REVERSE 
COMPLEMENT) 

GGACAAGTTTGTACAAAAAAGCAGGC TCTATTTT TT C CTTGATCATTC C AAAATC AAATCCTAAATTAACAAAAAAAG 
ACTTCCTAACAAAGAAAGTTATCCCACTTAACTATGTTGCTCTTGGAGATTCTCTGACCGAAGGTGTGGGGGATACAA 
CCTCTCAAGGTGGTTTTGTCCCACTGCTATCAGAATCACTCCATAATCGATACTCTTACCAAGTGACTTCTGTTAATT 
ATGGTGTGTCTGGGAATACTAGTCAACAAATTTTAAAACGTATGACGACAGATCCTCAAATCGAAAAAGATTTAGAGA 
AAGCTGATTTATTGACGCTAACTGTTGGTGGTAATGATGTCTTGGCTGTTATTCGTAAAGAGCTCAGTCATTTATCAC 
TAAATTCCTTTGAGAAACCAGCAGAAGCATATAAGGAACGTTTGAAAGAAATTCTTGCAAAAGCAAGACAAGATAATC 
CTAAATTGCCTATTTATGTTTTAGGCATTTATAATCCTTTTTACCTAAACTTTCCACAATTAACTAAAATGCAAACCG 
TTATTGATAATTGGAATAAAGCTACAAAAGAAGTAGTTGATGCTTCAGAAAATGTTTATTTTGTCCCAATTAATGACC 
GCCTTTATAAGGGAATAAATGGTAAAGAGGGTATTACAGAGTCATCAAATAGTCAGGCAAGTATCACTAATGATGCTC 

TCTTTACTGGAGACCATTTTCATCCCAATAATATTGGCTATCAAATCATGTCTAACGCCGTTATGGAGAAAATAAATG 
AAACAAGAAAAAACTGGCCGAACCCAGCTTTCTTGTACAAA 
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Table 26: Comparative Sequences relating to SAG0503 (lipase/acylhydolase) 

SEQ ID NO. 2 605: SAG0503 FROM THE CJB110 GBS NONTYPEABLE STRAIN (REVERSE 
COMPLEMENT) 

GTTTGTACAAAAAAGCAGGCTCTATTTTTTCCTTGATCATTCCAAAATCAAATCCTAAATTAACAAAAAAAGACTTCC 
TAACAAAGAAAGTTATCCCACTTAACTATGTTGCTCTTGGAGATTCTCTGACCGAAGGTGTGGGCGATACAACCTCTC 
AAGGTGGTTTTGTCCCACTGCTATCAGAATCACTCCATAATCGATACTCTTACCAAGTGACTTCTGTTAATTATGGTG 
TGTCTGGGAATACTAGTCAACAAATTTTAAAACGTATGACGACAGATCCTCAAATCGAAAAAGATTTAGAGAAAGCTG 
ATTTATTGACGCTAACTGTTGGTGGTAATGATGTCTTGGCTGTTATTCGTAAAGAGCTCAGTCATTTATCACTAAATT 
CCTTTGAGAAAC CAGCAGAAGC ATATAAGGAACGTT TGAAAGAAATAC T TGC AAAAG CAAGACAAGATAATCCTAAAT 
TGCCTATTTATGTTTTAGGCATTTATAATCCTTTTTACCTAAACTTTCCACAATTAACTAAAATGCAAACCGTTATTG 
ATAATTGGAATAAAGCTACAAAAGAAGTAGTTGATGCTTCAGAAAATGT TTATTTTGTCC C AATTAATGACCGC CTTT 
ATAAGGGAAT AAATGGTAAAGAGGGTATTACAGAGT CAT CAAATAGTC AGGC AAGTATCACTAATGATGCTC TC TTTA 
C TGGAGAC C ATT TTCATC C C AATAATATTGGCTATC AAATC ATGT C TAACGC CGTTATGGAGAAAAT AAATGAAACAA 
GAAAAAACTGGCCGAACCCAGCTTTCTTGTACAA 

SEQ ID NO. 2606: SAG0503 FROM THE 1169NT1 GBS TYPE V STRAIN (REVERSE 
COMPLEMENT) 

GTTTGTACAAAAAAGCAGGCTCTATTTTTTCCTTGATCATTCCAAAATCAAATCCTAAATTAACAAAAAAAGACTTCC 
TAACAAAGAAAGTTATCCCAC TTAACTATGTTGC TC TTGGAGATT C TC TGAC CGAAGGTGTGGGGGATAC AAC CTCTC 
AAGGTGGTTTTGTCCCACTGCTATCAGAATCACTCCATAATCGATACTCTTACCAAGTGACTTCTGTTAATTATGGTG 
TGTCTGGGAATACTAGTCAACAAATTTTAAAACGTATGACGACAGATCCTCAAATCGAAAAAGATTTAGAGAAAGCTG 
ATTTATTGACGCTAACTGTTGGTGGTAATGATGTCTTGGCTGTTATTCGTAAAGAGCTCAGTCATTTATCACTAAATT 
C CTTTGAGAAACCAGC AGAAGCATATAAGGAACGTTTGAAAGAAATTCTTGC AAAAGCAAGACAAGATAATC CTAAAT 
TGCCTATTTATGTTTTAGGCATTTATAATCCTTTTTACCTAAACTTTCCACAATTAACTAAAATGCAAACCGTTATTG 
ATAATTGGAATAAAGCTACAAAAGAAGTAGTTGATGCTTCAGAAAATGTTTATTTTGTCCCAATTAATGACCGCCTTT 
ATAAGGGAATAAATGGTAAAGAGGGTATTACAGAGT CATCAAATAGTCAGGCAAGTATCAC TAATGATGC TC TC TTTA 
CTGGAGACCATTTTCATCCCAATAATATTGGCTATCAAATCATGTCTAACGCCGTTATGGAGAAAATAAATGAAACAA 
GAAAAAACTGGCCGAACCCAGCTTTCTTGTACAAA 

SEQ ID NO. 2607: SAG0503 FROM THE? JM9130013 GBS TYPE VIII STRAIN 
(REVERSE COMPLEMENT) 

GTTTGTACAAAAAAGCAGGCTCTATTTTTTCCTTGATCATTCCAAAATCAAATCCTAAATTAACAAAAAAAGACTTCC 
TAAC AAAGAAAGTTAT CC CACTTAACTATGTTGCTCTTGGAGATTCTCTGACCGAAGGTGTGGGCGATACAAC CTCTC 
AAGGTGGTTTTGTTCC ACTGCTAT C AGAATCACT CC ATAATCGATACT C TTACCAAGTGACTTCTGTTAATTATGGTG 
TGTCTGGGAATACTAGTCAACAAATTTTAAAACGTATGACGACAGATCCTCAAATCGAAAAAGATTTAGAGAAAGCTG 
ATTTATTGACGCTAACTGTTGGTGGTAATGATGTCTTGGCTGTTATTCGTAAAGAGCTCAGTCATTTATCACTAAATT 
C CTTTGAGAAAC CAGCAGAAGC AT AT AAGGAACGTTTGAAAGAAATCCTTGCAAAAGCAAGACAAGATAATC CTAAAT 
TGCCTATTTATGTTTTAGGCATTTATAATCCTTTTTACCTAAACTTTCCACAATTAACTAAAATGCAAACCGTTATTG 
ATAATTGGAATAAAGCTACAAAAGAAGTAGTTGATGC TTC AGAAAATGTTTATTTTGT CC C AATTAATGAC CGC CTTT 
AT AAGGGAAT AAAT GGT AAAG AGGGT AT TAC AG AGT CAT C AAAT AGT C AGG C AAGT AT C A C T AATG ATG C T C T C T T TA 
CTGGAGAC CATTTT CATCCC AATAATATTGGCTATC AAATCATGTC TAACGC CGTTATGGAGAAAATAAATGAAACAA 
GAAAAAACTGGCCGAACCCAGCTTTCTTGTACAAA 

SEQ ID NO. 2608: SAG0503 FROM THE 2603 V/R GBS TYPE V STRAIN 
(REVERSE COMPLEMENT) 

AGTTTGTACAAAAAAGCAGGCTCTATTTTTTCCTTGATCATTCCAAAATCAAATCCTAAATTAACAAAAAAAGACTTC 
C TAACAAAGAAAGTTATC C CACTTAACTATGTTGCTCT TGGAGATTC T CTGAC CGAAGGTGTGGGCGATACAAC CTCT 
CAAGGTGGTTTTGTTC C ACTGCTATCAGAAT CACTC CATAATCGATAC T C TTAC CAAGTGACTTCTGTTAATTATGGT 
GTGTCTGGGAATACTAGTCAACAAATTTTAAAACGTATGACGACAGATCCTCAAATCGAAAAAGATTTAGAGAAAGCT 
GATTTATTGACGCTAACTGTTGGTGGTAATGATGTCTTGGCTGTTATTCGTAAAGAGCTCAGTCATTTATCACTAAAT 
TCCTTTGAGAAAC CAG C AGAAGC ATATAAGGAACGTTTGAAAGAAATC CTTGCAAAAGCAAGACAAGATAAT CC TAAA 
TTGCCTATTTATGTTTTAGGCATTTATAATCCTTTTTACCTAAACTTTCCACAATTAACTAAAATGCAAACCGTTATT 
GATAATTGGAATAAAG CTAC AAAAGAAGTAGTTGATGCTT C AGAAAATGTTTATTTTGTC C CAATTAATGAC CGCCTT 
TATAAGGGAATAAATGGTAAAGAGGGTATTAC AGAGTCATCAAATAGT CAGGCAAGTATC ACTAATGATGC TC TC TTT 

AC TGGAGAC CAT TTTCATCC CAATAATATTGGCTATCAAATCATGTCTAACGC CGTTATGGAGAAAAT AAATGAAAC A 
AGAAAAAACTGGCCGAACCCAGCTTTCTTGTACAAAGTGG 



617 



WO 2004/018646 



PCT/US2003/026827 



Table 26: Comparative Sequences relating to SAG0503 (lipase/acylhydolase) 

SEQ ID NO. 2609: SAG0503 FROM THE M781 GBS TYPE III STRAIN 
(REVERSE COMPLEMENT) 

GGACAAGTTTGTACAAAAAAGCAGGCTCTATTTTTTCCTTGATCATTCCAAAATCAAATCCTAAATTAACAAAAAAAG 
ACTTCCTAACAAAGAAAGTTATCCCACTTAACTATGTTGCTCTTGGAGATTCTCTGACCGAAGGTGTGGGGGATACAA 
CCTCTCAAGGTGGTTTTGTCCCACTGCTATCAGAATCACTCCATAATCGATACTCTTACCAAGTGACTTCTGTTAATT 
ATGGTGTGTCTGGGAATACTAGTCAACAAATTTTAAAACGTATGACGACAGATCCTCAAATCGAAAAAGATTTAGAGA 
AAGCTGATTTATTGACGCTAACTGTTGGTGGTAATGATGTCTTGGCTGTTATTCGTAAAGAGCTCAGTCATTTATCAC 
TAAATTCCTTTGAGAAACCAGCAGAAGCATATAAGGAACGTTTGAAAGAAATTCTTGCAAAAGCAAGACAAGATAATC 
CTAAATTGCCTATTTATGTTTTAGGCATTTATAATCCTTTTTACCTAAACTTTCCACAATTAACTAAAATGCAAACCG 
TTATTGATAATTGGAATAAAGCTACAAAAGAAGTAGTTGATGCTTCAGAAAATGTTTATTTTGTCCCAATTAATGACC 
GCCTTTATAAGGGAATAAATGGTAAAGAGGGTATTACAGAGTCATCAAATAGTCAGGCAAGTATCACTAATGATGCTC 
TCTTTACTGGAGACCATTTTCATCCCAATAATATTGGCTATCAAATCATGTCTAACGCCGTTATGGAGAAAATAAATG 
AAACAAGAAAAAACTGGCCGAACCCAGCTTTCTTGTACAAA 



SEQ2 601 GGC^CAAGTTTGTACAAAAAAGCAGGCTCTATTTTTTCCTTGATCATTCCAAAATCAAA 

SEQ2602 TTTGTACAAAAAAGCAGGCTCTATTTTTTCCTTGATCATTCCAAAATCAAA 

SEQ2603 GTTTGTACAAAAAAGCAGGCTCTATTTTTTCCTTGATCATTCCAAAATCAAA 

SEQ2 604 GGACAAGTTTGTACAAAAAAGCAGGCTCTATTTTTTCCTT^ 

SEQ2 605 GTTTGTACAAAAAAGCAGGCTCTATTTTTTCCTTGATCATTCCAAAATCAAA 

SEQ 2 606 GTTTGTACAAAAAAGCAGGCTCTATTTTTTCCTTGATGATTCCAAAATCAAA 

SEQ2 607 GTTTGTACAAAAAAGC^GGCTCTATTTTTTCCTTGATC^TTCCaAAATCAAA 

SEQ2 608 AGTTTGTACAAAAAAGC^GGCTCTATTTTTTCCTTGATCATTCCAAAATCAAA 

SEQ2 609 GGAC^GTTTGTACAAAAAAGCAGGCTCTATTTTTTCCTTGATCATTCCAAAATCAAA 

SEQ2 601 TCCTAAATTAACAAAAAAAGACTTCCTAACAAAG^ 

SEQ2 602 TCCTAAATTAACAAAAAAAGACTTCCTAACAAAGAAAGTTATCCCACTTAACTATGTTGC 

SEQ2 603 TCCTAAATTAACAAAAAAAGACTTCCTAACAAAGAAAGTTATCCCACTTAACTATGTTGC 

SEQ2 604 TCCTAAATTAACAAAAAAAGACTTCCTAACAAAGAAAGTTATCCCACTTAACTATGTTGC 

SEQ2 605 TCCTAAATTAACAAAAAAAGACTTCCTAACAAAGAAAGTTATCCCACTTAACTATGTTGC 

SEQ2 606 TC CTAAAT TAACAAAAAAAGAC T T CC TAACAAAGAAAGT TAT C C CAC T TAAC TATGT T G C 

SEQ2607 TCCTAAATTAACAAAAAAAGACTTCCTAACAAAGAAAGTTATCCCACTTAACTATGTTGC 

SEQ2608 TCCTAAATTAACAAAAAAAGACTTCCTAACAAAGAAAGTTATCCCACTTAACTATGTTGC 

SEQ 2 609 TCCTAAATTAACAAAAAAAGACTTCCTAAGAAAGAAAGTTATCCCACTTAACTATGTTGC 

SEQ2 601 TCTTGGAGATTCTCTGACCGAAGGTGTGGGCGATACAACCTCTCAAGGTGGTTTTGTCCC 

SEQ2 602 TCTTGGAGATTCTCTGACCGAAGGTGTGGGCGATACAACCTCTCAAGGTGGTTTTGTTCC 

SEQ2 603 TCTTGGAGATTCTCTGACCGAAGGTGTGGGCGATACAACCTCTCAAGGTGGTTTTGTTCC 

SEQ2 6 04 TCTTGGAGATTCTCTGACCGAAGGTGTGGGGGATACAACCTCTCAAGGTGGTTTTGTCCC 

SEQ2 605 TCTTGGAGATTCTCTGACCGAAGGTGTGGGCGATACAACCTCTCAAGGTGGTTTTGTCCC 

SEQ2 606 TCTTGGAGATTCTCTGACCGAAGGTGTGGGGGATACAACCTCTCAAGGTGGTTTTGTCCC 

SEQ2 607 TCTTGGAGATTCTCTGACCGAAGGTGTGGGCGATACAACCTCTCAAGGTGGTTTTGTTCC 

SEQ2 608 TCTTGGAGATTCTCTGACCGAAGGTGTGGGCGATACAACCTCTCAAGGTGGTTTTGTTCC 

SEQ2 609 TCTTGGAGATTCTCTGACCGAAGGTGTGGGGGATACAACCTCTCAAGGTGGTTTTGTCCC 

SEQ2 601 ACTGCTATCAGAATCACTCCATAATCGATACTCTTACCAAGTGACTTCTGTTAATTATGG 

SEQ2 602 ACTGCTATCAGAATCACTCCATAATCGATACTCTTACCAAGTGACTTCTGTTAATTATGG 

SEQ2 603 ACTGCTATCAGAATCACTCCATAATCGATACTCTTACCAAGTGACTTCTGTTAATTATGG 

SEQ2 6 04 ACTGCTATCAGAATCACTCCATAATCGATACtCTTACCAAGTGACTTCTGTTAATTATGG 

SEQ2 6 0 5 ACTGCTATCAGAATCACTCCATAATCGATACTCTTACCAAGTGACTTCTGTTAATTATGG 

SEQ2 606 ACTGCTATCAGAATCACTCCATAATCGATACTCTTACCAAGTGACTTCTGTTAATTATGG 

SEQ2 607 ACTGCTATCAGAATCACTCCATAATCGATACTCTTACCAAGTGACTTCTGTTAATTATGG 

SEQ2 608 ACTGCTATCAGAATCACTCCATAATCGATACTCTTACCAAGTGACTTCTGTTAATTATGG 

SEQ2 609 ACTGCTATCAGAATCACTCCATAATCGATACTCTTACCAAGTGACTTCTGTTAATTATGG 

SEQ2 601 TGTGTCTGGGAATACTAGTCAACAAATTTTAAAACGTATGACGACAGATCCTCAAATCGA 

SEQ2 602 TGTGTCTGGGAATACTAGTCAACAAATTTTAAAACGTATGACGACAGATCCTCAAATCGA 

SEQ2 603 TGTGTCTGGGAATACTAGTCAACAAATTTTAAAACGTATGACGACAGATCCTCAAATCGA 

SEQ2604 TGTGTCTGGGAATACTAGTCAACAAATTTTAAAACGTATGACGACAGATCCTCAAATCGA 

SEQ2 605 TGTGTCTGGGAATACTAGTCAACAAATTTTAAAACGTATGACGACAGATCCTCAAATCGA 

SEQ2 606 TGTGTCTGGGAATACTAGTCAACAAATTTTAAAACGTATGACGACAGATCCTCAAATCGA 

SEQ2 6 0 7 TGTGTCTGGGAATACTAGTCAACAAATTTTAAAACGTATGACGACAGATCCTCAAATCGA 

SEQ2608 TGTGTCTGGGAATACTAGTCAACAAATTTTAAAACGTATGACGACAGATCCTCIAAATCGA 

SEQ2 609 TGTGTCTGGGAATACTAGTCAACAAATTTTAAAACGTATGACGACAGATCCTCAAATCGA 

SEQ2 601 AAAAGATTTAGAGAAAGCTGATTTATTGACGCTAACTGTTGGTGGTAATGATGTCTTGGC 

SEQ2 602 AAAAGATTTAGAGAAAGCTGATTTATTGACGCTAACTGTTGGTGGTAATGATGTCTTGGC 
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Table 26: 

SEQ2 60 3 

SEQ2604 
SEQ2605 
SEQ2 60 6 
SBQ2607 
SEQ2608 
SEQ2 60 9 

SEQ2601 
SEQ2 602 
SEQ2603 
SEQ2604 
SEQ2605 
SEQ2606 
SEQ2607 
SEQ2608 
SEQ2609 

SEQ2 601 
SEQ2602 
SEQ2603 
SEQ2604 
SEQ2605 
SEQ2606 
SEQ2 607 
SEQ2 60 8 
SEQ2609 



Comparative Sequences relating to SAG0503 (lipase/acylhydolase) 

AAAAGATTTAGAGAAAGCTGATTTATTGACGCTAACTGTTGGTGGTAATGATGTCTTGGC 

AAAAGATTTAGAGAAAGCTGATTTATTGACGCTAACTGTTGGTGGTAATGATGTCTTGGC 
AAAAGATTTAGAGAAAGCTGATTTATTGACGCTAACTGTTGGTGGTAATGATGTCTTGGC 
AAAAGATTTAGAGAAAGCTGATTTATTGACGCTAACTGTTGGTGGTAATGATGTCTTGGC 
AAAAGATTTAGAGAAAGCTGATTTATTGACGCTAACTGTTGGTGGTAATGATGTCTTGGC 
AAAAGATTTAGAGAAAGCTGATTTATTGACGCTAACTGTTGGTGGTAATGATGTCTTGGC 
AAAAGATTTAGAGAAAGCTGATTTATTGACGCTAACTGTTGGTGGTAATGATGTCTTGGC 

TGTTATTCGTAAAGAGCTCAGTCATTTATCACTAAATTCCTTTGAGAAACCAGCAGAAGC 
TGTTATTCGTAAAGAGCTCAGTCATTTATCACTAAATTCCTTTGAGAAACCAGCAGAAGC 
TGTTATTCGTAAAGAGCTCAGTCATTTATCACTAAATTCCTTTGAGAAACCAGCAGAAGC 
TGTTATTCGTAAAGAGCTCAGTCATTTATCACTAAATTCCTTTGAGAAACCAGCAGAAGC 
TGTTATTCGTAAAGAGCTCAGTCATTTATCACTAAATTCCTTTGAGAAACCAGCAGAAGC 
TGTTATTCGTAAAGAGCTCAGTCATTTATCACTAAATTCCTTTGAGAAACCAGCAGAAGC 
TGTTATTCGTAAAGAGCTCAGTCATTTATCACTAAATTCCTTTGAGAAACCAGCAGAAGC 
TGTTATTCGTAAAGAGCTCAGTCATTTATCACTAAATTCCTTTGAGAAACCAGCAGAAGC 
TGTTATTCGTAAAGAGCTCAGTCATTTATCACTAAATTCCTTTGAGAAACCAGCAGAAGC 

ATATAAGGAACGTTTGAAAGAAATACTTGCAAAAGCAAGACAAGATAATCCTAAATTGCC 
ATATAAGGAACGTTTGAAAGAAATCCTTGCAAAAGCAAGACAAGATAATCCTAAATTGCC 
ATATAAGGAACGTTTGAAAGAAATCCTTGCAAAAGCAAGACAAGATAATCCTAAATTGCC 
ATATAAGGAACGTTTGAAAGAAATTCTTGC^^AGCAAGACAAGATAATCCTAAATTGCC 
ATATAAGGAACGTTTGAAAGAAATACTTGCAAAAGCAAGACAAGATAATCCTAAATTGCC 
ATATAAGGAAC GT T TGAAAGAAAT T C T TG CAAAAGCAAGA CAAGATAAT C CTAAAT TGC C 
ATATAAGGAACGTTTGAAAGAAATCCTTGCAAAAGCAAGACAAGATAATCCTAAATTGCC 
ATATAAGGAACGTTTGAAAGAAATCCTTGCAAAAGCAAGACAAGATAATCCTAAATTGCC 
ATATAAGGAACGTTTGAAAGAAATTCTTGCAAAAGCAAGACAAGATAATCCTAAATTGCC 



SEQ2601 
SEQ2602 
SEQ2603 
SEQ2 604 
SEQ2605 
SEQ2606 
SEQ2607 
SEQ260 8 
SEQ2609 

SEQ2601 
SEQ2602 
SEQ2603 
SEQ2604 
SEQ2605 
SEQ2 60 6 
SEQ2607 
SEQ2 60 8 
SEQ2 60 9 

SEQ2601 
SEQ2602 
SEQ2603 
SEQ2604 
SEQ2 60 5 
SEQ2606 
SEQ2607 
SEQ260 8 
SEQ260 9 

SEQ2601 
SEQ2602 
SEQ2 603 
SEQ26 04 
SEQ2605 
SEQ2 60 6 
SEQ2607 
SEQ2608 
SEQ2609 

SEQ2 601 
SEQ2602 



TATTTATGTTTTAGGCATTTATAATCCTTTTTACCTAAACTTTCCACAATTAACTAAAAT 
TATTTATGTTTTAGGCATTTATAATCCTTTTTACCTAAACTTTCCACAATTAACTAAAAT 
TATTTATGTTTTAGGCATTTATAATCCTTTTTACCTAAACTTTCCACAATTAACTAAAAT 
TATTTATGTTTTAGGCATTTATAATCCTTTTTACCTAAACTTTCCACAATTAACTAAAAT 
TATTTATGTTTTAGGCATTTATAATCCTTTTTACCTAAACTTTCCACAATTAACTAAAAT 
TATTTATGTTTTAGGCATTTATAATCCTTTTTACCTAAACTTTCCACAATTAACTAAAAT 
TATTTATGTTTTAGGCATTTATAATCCTTTTTACCTAAACTTTCCACAATTAACTAAAAT 
TATTTATGTTTTAGGCATTTATAATCCTTTTTACCTAAACTTTCCACAATTAACTAAAAT 
TATTTATGTTTTAGGCATTTATAATCCTTTTTACCTAAACTTTCCACAATTAACTAAAAT 

GCAAACCGTTATTGATAATTGGAATAAAGCTACAAAAGAAGTAGTTGATGCTTCAGAAAA 
GCAAACCGTTATTGATAATTGGAATAAAGCTACAAAAGAAGTAGTTGATGCTTCAGAAAA 
GCAAACCGTTATTGATAATTGGAATAAAGCTACAAAAGAAGTAGTTGATGCTTCAGAAAA 
GCAAACCGTTATTGATAATTGGAATAAAGCTACAAAAGAAGTAGTTGATGCTTCAGAAAA 
GCAAACCGTTATTGATAATTGGAATAAAGCTACAAAAGAAGTAGTTGATGCTTCAGAAAA 
GCAAACCGTTATTGATAATTGGAATAAAGCTACAAAAGAAGTAGTTGATGCTTCAGAAAA 
GCAAACCGTTATTGATAATTGGAATAAAGCTACAAAAGAAGTAGTTGATGCTTCAGAAAA 
GCAAACCGTTATTGATAATTGGAATAAAGCTACAAAAGAAGTAGTTGATGCTTCAGAAAA 
GCAAACCGTTATTGATAATTGGAATAAAGCTACAAAAGAAGTAGTTGATGCTTCAGAAAA 

TGTTTATTTTGTCCCAATTAATGACCGCCTTTATAAGGGAATAAATGGTAAAGAGGGTAT 
TGTTTATTTTGTCCCAATTAATGACCGCCTTTATAAGGGAATAAATGGTAAAGAGGGTAT 
TGTTTATTTTGTCCCAATTAATGACCGCCTTTATAAGGGAATAAATGGTAAAGAGGGTAT 
TGTTTATTTTGTCCCAATTAATGACCGCCTTTATAAGGGAATAAATGGTAAAGAGGGTAT 
TGTTTATTTTGTCCCAATTAATGACCGCCTTTATAAGGGAATAAATGGTAAAGAGGGTAT 
TGTTTATTTTGTCCCAATTAATGACCGCCTTTATAAGGGAATAAATGGTAAAGAGGGTAT 
TGTTTATTTTGTCCCAATTAATGACCGCCTTTATAAGGGAATAAATGGTAAAGAGGGTAT 
TGTTTATTTTGTCCCAATTAATGACCGCCTTTATAAGGGAATAAATGGTAAAGAGGGTAT 
TGTTTATTTTGTCCCAATTAATGACCGCCTTTATAAGGGAATAAATGGTAAAGAGGGTAT 

TACAGAGTCATCAAATAGTCAGGCAAGTATCACTAATGATGCTCTCTTTACTGGAGACCA 
TATAGAGTCATCAAATAGTCAGGCAAGTATCACTAATGATGCTCTCTTTACTGGAGACCA 
TACAGAGTCATCAAATAGTCAGGCAAGTATCACTAATGATGCTCTCTTTACTGGAGACCA 
TACAGAGT CAT CAAATAGTCAGGCAAGTAT CACTAATGATGCT CTCTT TACTGGAGACCA 
TACAGAGTCATCAAATAGTCAGGCAAGTATCACTAATGATGCTCTCTTTACTGGAGACCA 
TACAGAGTCATCAAATAGTCAGGCAAGTATGACTAATGATGCTCTCTTTACTGGAGACCA 
TACAGAGTCATCAAATAGTCAGGCAAGTATCACTAATGATGCTCTCTTTACTGGAGACCA 
TACAGAGTCATCAAATAGTCAGGCAAGTATCACTAATGATGCTCTCTTTACTGGAGACCA 
TACAGAGTCATCAAATAGTCAGGCAAGTATCACTAATGATGCTCTCTTTACTGGAGACCA 

TTTTC^TCCC^TAATATTGGCTATC^UUVTC^TGTCTAACGCCGTTATGGAGAAAATAAA 
TTTTCATCCCAATAATATTGGCTATCAAATCATGTCTAACGCCGTTATGGAGAAAATAAA 
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Table 26: Comparative Sequences relating to SAG0503 (lipase/acylhydolase) 

SEQ2 603 TTTTCATCCCAATAATATTGGCTATCAAATCATGTCTAACGCCGTTATGGAGAAAATAAA 

SEQ2 6 04 TTTTC^TCCCAATAATATTGGCTATCAAATCATGTCTAACGCCGTTATGGAGAAAATAAA 

SEQ2 605 TTTTC^TCCC^TAATATTGGCTATCAAATCATGTCTAACGCCGTTATGGAGAAAATAAA 

SEQ2 606 TTTTCATCCCAATAATATTGGCTATCAAATCATGTCTAACGCCGTTATGGAGAAAATAAA 

SKQ2607 TTTTCATCCCAATAATATTGGCTATCAAATCATGTCTAACGCCGTTATGGAGAAAATAAA 

SEQ2 608 TTTTCATCCC^TAATATTGGCTATCAAATCATGTCTAACGCCGTTATGGAGAAAATAAA 

SEQ2 609 TTTTCATCCCAATAATATTGGCTATCAAATCATGTCTAACGCCGTTATGGAGAAAATAAA 

SEQ2 601 TGAAACAAGAAAAAACTGGCCGAACCCAGCTTTCTTGTACAAAG 

SEQ2602 TGAAACAAGAAAAAACTGGCCGAACCCAGCTTTCTTGTACAAAGTGGTCC 

SEQ2603 TGAAACAAGAAAAAACTGGCCGAACCCAGCTTTCTTGTACAA 

SEQ2604 TGAAACAAGAAAAAACTGGCCGAACCCAGCTTTCTTGTACAAA 

SEQ2 605 TGAAACAAGAAAAAACTGGCCGAACCCAGCTTTCTTGTACAA 

SEQ2 606 TGAAACAAGAAAAAACTGGCCGAACCCAGCTTTCTTGTACAAA 

SEQ2607 TGAAACAAGAAAAAACTGGCCGAACCCAGCTTTCTTGTACAAA 

SEQ2 608 TGAAACAAGAAAAAACTGGCCGAACCCAGCTTTCTTGTACAAAGTGG 

SEQ2 609 TGAAACAAGAAAAAACTGGCCGAACCCAGCTTTCTTGTACAAATABCMARATVSTNCSRA 

SEQ2601 -- 

SEQ2602 

SEQ2603 

SEQ2604 

SEQ2605 

SEQ2606 

SEQ2607 

SEQ2608 

SEQ2 609 NGT SAGAS AC YHYDA S 

>SEQ ID NO 2650:103__090 frame: 2 

I FSLII PKSNPKLTKKDFLTKKVI PLNYVALGDSLTEGVGDTTSQGGFVP 
LLSESLHNRYSYQVTSVNYGVSGNTSQQILI<^MTTDPQIEKDLEKADLLTLWGGNDVIA 
VIRKELSHLSLNSFEKPAEAYKERLKE ILAKARQDNPKLP I YVLGI YNPFYLNPPQLTKM 
QTVIDNWNKATKEVVDASENVYFVPINDRL YKGINGKEGI TESSNSQAS ITNDALFTGDH 
FHPNNIGYQIMSNAVMEKINETRKNWP 

>SEQ ID NO 2651:103_H3 6B frame: 2 

IFSLII PKSNPKLTKKDFLTKKVI PLNYVALGDSLTEGVGDTTSQGGFVPLLS 
ESLHNRYSYQVTSVNYGVSGNTSQQILKRMTTDPQIEKDLEKADLLTLTV^^ 
KELSHLSLNSFEKPAEAYKERLKEILAKARQDNPKLPI YVLGI YNPFYL 
IDNWNKATKEVVDASENVYFVPINDRL YKGINGKEGI I ESSNSQAS ITNDALFTGDHFHP 
NNIGYQIMSNAVMEKINETRKNWP 

>SEQ ID NO 2652:103_18RS21 frame: 3 

IFSLII PKSNPKLTKKDFLTKKVI PLNYVALGDSLTEGVGDTTSQGGFVPLLS 
ESLHNRYSYQVTSVNYGVSGNTSQQILKRMTTDPQIEKDLEKADLLTLTVGGNDVLAVIR 
KELSHLSLNSFEKPAEAYKERLKE ILAKARQDNPKLP I YVLGI YNPFYLNFPQLTKMQTV 
IDNWNKATKEWDAS ENVYFVP INDRL YKGINGKEGI TES SNS QAS ITNDAL FTGDHFHP 
NNIGYQIMSNAVMEKINETRKNWP 

>SEQ ID NO 2653:103_COH1 frame: 3 

I FSLI I PKSNPKLTKKDFLTKKVI PLNYVALGDSLTEGVGDTTSQGGFVPL 
LSESLHNRYSYQVTSVNYGVSGNTSQQILKRMTTDPQIEKDLEKADLLTLTVGGNDVLAV 
IRKELSHLSLNSFEKPAEAYKERLKEILAKARQDNPKLPIYVLGIYNPFYLNFPQLTKMQ 
TVIDNWNKATKEVVDASENVYFVP INDRL YKGINGKEGITESSNSQASITNDALFTGDHF 
HPNNIGYQIMSNAVMEKINETRKNWP 

>SEQ ID NO 2654:103_CJB110 frame: 3 

IFSLII PKSNPKLTKKDFLTKKVI PLNYVALGDSLTEGVGDTTSQGGFVPLLS 
ESLHNRYSYQVTSVNYGVSGNTSQQILKRMTTDPQIEKDLEKADLLTLTVGGNDVLAVIR 
KELSHLSLNSFEKPAEAYKERLKEIIiAKARQDNPKLPI YVLGI YNPFYLNFPQLTKMQTV 
I DNWNKATKFAH/DAS ENVYFVP INDRL YKGINGKEGI TES SNS QAS ITNDALFTGDHFHP 
NNIGYQIMSNAVMEKINETRKNWP 

>SEQ ID NO 2655:103_1169NT frame: 3 

I FSLI I PKSNPKLTKKDFLTKKVI PLNYVALGDSLTEGVGDTTSQGGFVPLLS 
ESLHNRYS YQVTS VNYGVSGNTSQQILKRMTTDPQ IEKDLEKADLLTLTVGGNDVLAVI R 
KELSHLSLNSFEKPAEAYKERLKEILAKARQDNPKLPI YVLGI YNPFYLNFPQLTKMQTV 
IDNWNKATKEVVDASENVYFVPINDRLYKGINGKEGITESSNSQAS ITNDALFTGDHFHP 
NNIGYQIMSNAVMEKINETRKNWP 
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Table 26: Comparative Sequences relating to SAG0503 (lipase/acylhydolase) 



>SEQ ID NO 2 656:103_JW9130013 frame: 3 

I FSLI I PKSNPKLTKKDFL TKKVI PLNYVALGDSLTEGVGDTTSQGGFVPLLS 

ESLHNRYSYQVTSVNYGVSGNTSQQILKRMTTDPQIEKDLEKADLLTLW^^ 

KELSHLSLNS F EKPAEAYKERLKE I LAKARQDNPKL P I YVLGI YNPF YLNF PQLTKMQTV 

IDNWNKATKEVVDASENVYFVPINDRLYKGINGKEGITESSNSQASITNDALFTGDHFHP 

NNIGYQIMSNAVMEKINETRKNWP 

>SEQ ID NO 2657sl03_2603 frame: 1 

IFSLI I PKSNPKLTKKDFLTKKVI PLNWALGDSLTEGVGDTTSQGGFVPLL 
SESLHNRYSYQVTSWYGVSGNTSQQILKRMTTDPQIEKDLEKADLLTLTVGGNDVIiAVI 
RKELSHLS LNS FEKPAEAYKERLKE I LAKARQDNPKLP I YVLGI YNPF YLNF PQLTKMQT 
VIDNWNKATKEVVDASENVYFVPINDRLYKGINGKEGITESSNSQASITNDAIiFTGDHFH 
PNNIGYQIMSNAVMEKINETRKNWP 

>SEQ ID NO 2658 : 103_M781 frame: 3 

I FS L 1 1 PKSNPKLTKKDFLTKKVI PLNYVALGDSLTEGVGDTTS QGGFVPL 
LSESLHNRYSYQVTSVNYGVSGNTSQQILKRMTTDPQIEKDLEKADLLTLTV^ 
IRKELSHLSLNSFEKPAEAYKERLI^ILAXARQDNPKLPI YVLGI Y^ 
TVIDNWNKATKEVVDASENVYFVPINDRLYKGINGKEGIT^ 
HPNNIGYQIMSNAVMEKINETRKNWP 



SEQ2650 
SEQ2651 
SEQ2652 
SEQ2653 
SEQ2654 
SEQ2655 
SEQ2656 
SEQ2657 
SEQ2658 

SEQ2650 
SEQ2651 
SEQ2 652 
SEQ2653 
SEQ2654 
SEQ2 655 
SEQ2 65 6 
SEQ2657 
SEQ2658 

SEQ2650 
SEQ2651 
SEQ2652 
SEQ2653 
SEQ2 654 
SEQ2655 
SEQ265 6 
SEQ2657 
SEQ2658 

SEQ2650 
SEQ2651 
SEQ2652 
SEQ2653 
SEQ2654 
SEQ2655 
SEQ2656 
SEQ2657 
SEQ2658 

SEQ2650 
SEQ2651 
SEQ2652 
SEQ2653 
SEQ2654 
SEQ2 655 
SEQ2656 
SEQ2657 
SEQ2658 



I FSLI I PKSNPKLTKKDFLTKKVI PLNYVALGDSLTEGVGDTTSQGGFVPLLSESLHNRY 
I FSLI I PKSNPKLTKKDFLTKKVI PLNYVALGDSLTEGVGDTTSQGGFVPLLSESLHNRY 
I FSLI I PKSNPKLTKKDFLTKKVI PLNYVALGDSLTEGVGDTTSQGGFVPLL SESLHNRY 
IFSLI I PKSNPKLTKKDFLTKKVI PLNYVALGDSLTEGVGDTTSQGGFVPLLSESLHNRY 
I FSLI I PKSNPKLTKKDFLTKKVI PLNYVALGDSLTEGVGDTTSQGGFVPLLSESLHNRY 
I FSLI I PKSNPKLTKKDFLTKKVI PLNYVALGDSLTEGVGDTTSQGGFVPLLS ESLHNRY 
I FSLI I PKSNPKLTKKDFLTKKVI PLNYVALGDSLTEGVGDTTSQGGFVPLLS ESLHNRY 
I FSLI I PKSNPKLTKKDFLTKKVI PLNYVALGDSLTEGVGDTTSQGGFVPLLS ESLHNRY 
IFSLI I PKSNPKLTKKDFLTKKVI PLNYVALGDSLTEGVGDTTSQGGFVPLLSESLHNRY 

SYQVTSVNYGVSGNTSQQILKRMTTDPQIEKDLEKADLLTLTVGGNDVLAVIRKELSHLS 

SYQVTSVNYGVSGNTSQQILKRMTTDPQIEKDLEKADLLTLTVGGNDVLAVIRKELSHLS 

SYQVTSVNYGVSGNTSQQILKRMTTDPQIEKDLEKADLLTLTVGGNDVLAVIRKELSHLS 

SYQVTSVNYGVSGNTSQQILKRMTTDPQIEICDLEKADLLTLTVGGNDVIiAVIRKELS 

SYQVTSVNYGVSGNTSQQILKRMTTDPQIEKDLEKADLLTLWGGNDVLAVIRKELSHLS 

SYQVTSVNYGVSGNTSQQILKRMTTDPQIEKDLEKADLLTLWGGNDVLAVIRKELSH^ 

SYQVTSVNYGVSGNTSQQILKRMTTDPQIEKDLEKADLLTLTVGGNDVLAVIRKELSHLS 

SYQVTSVNYGVSGNTSQQILKRMTTDPQIEKDLEKADLLTLTVGGNDVLAVIRKELSHLS 

SYQ\n:SVNYGVSGNTSQQILKRMTTDPQIEKDLEKADLLTLTVGGNDVIAVIRKELSHLS 

LNS FEKPAEAYKERLKE I LAKARQDNPKLP I YVLGI YNPF YIiNFPQLTKMQW 

LNSFEKPAEAYKERLKEILAKARQDNPKLPIYVLGIYNPFYLNFPQLTKMQTVIDNWNKA 

LNSFEKPAEAYKERLKEILAKAJRQDNPKLPIYVLGIYNPFYLNFPQLTKMQW 

LNS FEKPAEAYKERLKE I LAKARQDNPKL P I YVLGI YNP F YLNFPQLTKMQTVI DNWNKA 

IjNS FEKPAEAYKERLKE ILAKARQDNPKLP I YVLGI YNPF YIiNFPQLTK^ 

LNS FEKPAEAYKERLKE I LAKARQDNPKL P I YVLGI YNPF YLNFPQLTKMQTVI DNWNKA 

LNSFEKPAEAYKERLKEILAKARQDNPKLPI YVLGI YNPFYLNFPQ 

IiNSFEKPAEAYKERLKEIIiAKARQDNPKLPIYVLGIYNPFYLNFPQLTKMQWIDNWNKA 
IjNSFEKPAEAYKERLKEILAKARQDNPKLPIYVLGIYNPFYLNFPQLTK^^ 

TKFiVVDASENVYFVPINDRLYKGINGKEGITESSNSQASITNDALFTGDHFHPNNIGYQI 
TKEWDAS ENVYFVP INDRL YKGI NGKEGI I E S SNS QAS I TNDAL FTGDHFHPNNI GYQ I 
TKEWDAS ENVYFVP I NDRL YKGI NGKEGI TESSNSQAS I TNDALF TGDHFHPNNI GYQ I 
TKEWDASENVYFVPINDRLYKGINGKEGITESSNSQASITNDALFTGDHFHPNNIGYQI 
TKEWDAS ENVYFVP I NDRL YKGI NGKEGI TE S SNSQAS I TNDALF TGDHFHPNNI GYQ I 
TKEWDAS ENVYFVP INDRL YKGI NGKEGI TE S SNSQAS I TNDALFTGDHFHPNNI GYQ I 
TKEWDAS ENVYFVP INDRL YKGINGKEGI TESSNSQAS I TNDALFTGDHFHPNNI GYQ I 
TKEWDAS ENVYFVP I NDRL YKGI NGKEGI TE S SN S QA S I TNDAL F TGDHFHPNNI GYQ I 
TKEWDAS ENVYFVPINDRLYKGINGKEGITESSNSQASITNDALFTGDHFHPNNIGYQI 

MSNAVMEKINETRKNWP 
MSNAVMEKINETRKNWP 
MSNAVMEKINETRKNWP 
MSNAVMEKINETRKNWP 
MSNAVMEKINETRKNWP 
MSNAVMEKINETRKNWP 
MSNAVMEKINETRKNWP 
MSNAVMEKINETRKNWP 
MSNAVMEKINETRKNWP 
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Table 27: Comparative Sequences relating to SAG1473 
(cell wall surface anchor family protein) 

SEQ ID NO. 2701: SAG1473 FROM THE 1169NT1 GBS TYPE V STRAIN 
(REVERSE COMPLEMENT) 

GATACAAGTGATAAGAA.TACTGACACGAGTGTCGTGACTACGACCTTATCTGAGGAGAAAAGATCAGATGA 
ACTAGACCAGTCTAGTACTGGTTCTTCTTCTGAAAATGAATCGAGTTCATCAAGTGAACCAGAAACAAATC 
CGTCAACTAATCCACCTACAACAGAACCATCGCAAOC^ 

ACGAAGACAGAAATTGGCAATAATAAGGATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTAT 
TAAGAATTTTAGTAAAGCAAGTAGTGATCAAGAAGAAGTGGATCGCGATGAATCATCATCTTCAAAAGCAA 
GTGATGGGAAAAAAGGCCACAGTAAGCCTAAAAAGGAA 

SEQ ID NO. 2702: SAG1473 FROM THE 18RS21 GBS TYPE II STRAIN 

GATACAAGTGATAAGAATACTGACACGAGTGTCGTGACTACGACCTTATCTGAGGAGAAAAGATCAGATGA 
ACTAGACCAGTCTAGTACTGGTTCTTCTTCTGAAAATGAATCGAGTTGATCAAGTGAACCAGAAACAAATC 
CGTCAACTAATCCACCTACAACAGAACCATCGCAACCCTCACCTAGTGAAGAGAACAAGCCTGATGGTAGA 
ACGAAGACAGAAATTGGCAATAATAAGGATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTAT 
TAAGAATTTTAGTAAAGCAAGTAGTGATCAAGAAGAAGTGGATCGCGATGAATCATC^TCTTC^AAAGC^ 
ATGATGGGAAAAAAGGCCACAGTAAGCCTAAAAAGGAA 

SEQ ID NO. 2703: SAG1473 FROM THE 2603 V/R GBS TYPE V STRAIN 

GATACAAGTGATAAGAATACTGACACGAGTGTCGTGACTACGACCTTATCTGAGGAGAAAAGATCAGATGA 
ACTAGACCAGTCTAGTACTGGTTCTTCTTCTGAAAATGAATCGAGTTCATCAAGTGAACCAGAAACAAATC 
CGTCAACTAATCCACCTACAACAGAACCATCGCAACCCTCACCTAGTGAAGAGAACAAGCCTGATGGTAGA 
ACGAAGACAGAAATTGGCAATAATAAGGATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTAT 
TAAGAATTTTAGTAAAGCAAGTAGTGAT CAAGAAGAAGTGGAT CG CGATGAAT CATCAT CTTCAAAAGCAA 
ATGATGGGAAAAAAGGCCACAGTAAGCCTAAAAAGGAA 

SEQ ID NO. 2704: SAG1473 FROM THE 090 GBS TYPE la STRAIN 

GACCAGTCTAGTACTGGTTCTTCTTCTGAAAATGAATCGAGTTCATCAAGTGAACCAGAAACAAATCCGTC 
AACTAATCCACCTACAACAGAACCATCGCAAC^ 

AGACAGAAATTGGCAATAATAAGGATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTATTAAG 
AATTTTAGTAAAGCAAGTAGTGATCAAGAAGAAGTGGATCGCGATGAATCATCATCTTCAAAAGCAAATGA 
TGGGAAAAAAGGCCACAGTAAGCCTAAAAAGGAA 

SEQ ID NO. 2705: SAG1473 FROM THE A909 GBS TYPE la STRAIN 

GATACAAGTGATAAGAATACTGACACGAGTGTCGTGACTACGACCTTATCTGAGGAGAAAAGATTAGATGA 
ACTAGAC(^GTCTAGTACTGGTTCTTCTTCTGAAAATGAATCGAGTTCATCAAGTGAACCAGAAACAAATC 
CCTCAACTAATCCACCTACAACAGAACCATCGCAACCCTCACCTAGTGAAGAGAACAAGCCTGATGGTAGC 
ACGAAGACAGAAATTGGCAATAATAAGGATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTAT 
TAAGAATTTTAGTAAAGCAAGTAGTGATCAAGAAGAAGTGGATCGCGATGAATCATCATCTTCAAAAGC^ 
ATGATGAGAAAAAAGGCCACAGTAAGCCTAAAAAGGAA 

SEQ ID NO. 2706: SAG1473 FROM THE CJB110 GBS NONTYPEABLE STRAIN 

GATACAAGTGATAAGAATACTGACACGAGTGTCGTGACTACGACCTTATCTGAGGAGAAAAGATCAGATGA 

ACTAGACCAGTCTAGTACTGGTTCTTCTTCTGAAAATGAATCGAGTTCATCAAGTGAACCAGAAACAAATC 

CGTCAACTAATCCACCTACAACAGAACCATCGCAACCCTCACCTAGTGAAGAGAACAAGCCTGATGGTAGA 

ACGAAGACAGAAATTGGCAATAATAAGGATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTAT 

TAAGAATTTTAGTAAAGCAAGTAGTGATCAAGAAGAAGTGGATCGCGATGAATCATCATCTTCAAAAGCAA 

ATGATGGGAAAAAAGGCCACAGTAAGCCTAAAAAGGAA 
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Table 27: Comparative Sequences relating to SAG1473 
(cell wall surface anchor family protein) 

SEQ ID NO. 2707: SAG1473 FROM THE C0H1 GBS TYPE III STRAIN 
(REVERSE COMPLEMENT) 

GATACAAGTGATAAGAATACTGACACGAGTGTCGTGACTACGACCTTATCTGAGGAGAAAAGATCAGATGA 

ACTAGACCAGTCTAGTACTGGTTCTTCTTCTGAAAATGAAT 

CCTCAACTAATCCACCTACAAGkGAACCATCGC^ 

ACGAAGACAGAAATTGGCAATAATAAGGATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTAT 
TAAGAATTTTAGTAAAGCAAGTAGTGATCAAGAAGAAGTGGAACGCGATGAATCATCATCTTCA 
ATGATGAGAAAAAAGGC CACAGTAAGC C TAAAAAGGAA 

SEQ ID NO. 2708: SAG1473 FROM THE H36b GBS TYPE lb STRAIN 

GATACAAGTGATAAGAATACTGACACGAGTGTCGTGACTACGACCTTATCTGAGGAGAAAAGATTAGATGA 

ACTAGACCAGTCTAGTACTGGTTCTTCTTCTGAAAATGAATCGAGTTCATCAAGTGAACCAGAAACAAATC 
CCTCAACTAATCCACCTACAACAGAACGA^ 

ACGAAGACAGAAATTGGCAATAATAAGGATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTAT 

TAAGAATTTTAGTAAAGCAAGTAGTGATGAAGAArAAGTGGATCGCGATGAATC^ 

ATGATGAGAAAAAAGGCCACAGTAAGCCTAAAAAGGAA 

SEQ ID NO. 2709: SAG1473 FROM THE JM910013 GBS TYPE VIII STRAIN 

GATACAAGTGATAAGAATACTGACACGAGTGTCGTGACTACGACCTTATCTGAGGAGAAAAGATTAGATGA 

ACTAGACCAGTCTAGTACTGGTTCTTCTTCTGAAAATGAATCGAGTTC^TCTUVGTGAACCAGAAACAAATC 

CCTCAACTAATCCACCTACAACAGAACCATCGCAACCCTGACCTAGTGAAGAGAACAAGCCTGATGGTAGC 

ACGAAGACAGAAATTGGCAATAATAAGGATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTAT 

TAAGAATTTTAGTAAAG CAAGTAGTGAT CAAGAAGAAGTGGAT CG CGATGAATCAT CATCTT CAAAAGCAA 

ATGATGAGAAAAAAGGCCACAGTAAGCCTAAAAAGGAA 

SEQ ID NO. 2710: SAG1473 FROM THE M732 GBS TYPE III STRAIN 

GATACAAGTGATAAGAATACTGACACGAGTGTCGTGACTACGACCTTATCTGAGGAGAAAAGATCAGATGA 
ACTAGACCAGTCTAGTACTGGTTCTTCTTCTGA 

CCTCAACTAATCCACCTACAACAGAACCATCGCAACCCTCACCTAGTGAAGAGAACAAGCCTGATGGGAGC 
ACGAAGACAGAAATTGGCAATAATAAGGATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTAT 
TAAGAATTTTAGTAAAGCAAGTAGTGATCAAGAAGAAGTGGAACGCGATGAATCATCATCTTCAAAAGCAA 
ATGATGAGAAAAAAGGCCACAGTAAGCCTAAAAAGGAA 

SEQ ID NO. 2711: SAG1473 FROM THE M781 GBS TYPE III STRAIN 

GATACAAGTGATAAGAATACTGACACGAGTGTCGTGACTACGACCTTATCTGAGGAGAAAAGATCAGATGA 
ACTAGACCAGTCTAGTACTGGTTCTTCTTCTGAAAATGAATCAAGTTCATCAAGTGAACCAGAAACAAATC 
CCT.CAACTAATCCACCTACAACAGAACCATCGCAACCCTCACCTAGTGAAGAGAACAAGCCTGATGGGAGC 
ACGAAGACAGAAATTGGCAATAATAAGGATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTAT 
TAAGAATT TTAGTAAAGCAAGTAGTGAT CAAGAAGAAGTGGATCGCGATGAAT CATCAT CT TCAAAAGCAA 
ATGATGAGAAAAAAGGCCACAGTAAGCCTAAAAAGGAA 



SEQ2 701 ATAC AAGTGATAAGAATACTGAC ACGAGTGTC GTGACTACGAC C T T ATCTGAGGAGAAA 

SEQ2 702 ATACAAGTGATAAGAATACTGACACGAGTGTCGTGACTACGACCTTATCTGAGGAGAAA 

SEQ2703 ATAC AAGTGATAAGAATACTGAC ACGAGTGTCGTGACTACGACC T TATCTGAGGAGAAA 

SEQ2704 

SEQ2 705 ATACAAGTGATAAGAATACTGACACGAGTGTCGTGACTACGACCTTATCTGAGGAGAAA 

SEQ2 706 ATACAAGTGATAAGAATACTGACACGAGTGTCGTGACTACGACCTTATCTGAGGAGAAA 

SEQ2 707 ATACAAGTGATAAGAATACTGACACGAGTGTCGTGACTACGACCTTATCTGAGGAGAAA 

SEQ2709 ATACAAGTGATAAGAATACTGACACGAGTGTCGTGACTACGACCTTATCTGAGGAGAAA 

SEQ2710 ATACAAGTGATAAGAATACTGACACGAGTGTCGTGACTACGACCTTATCTGAGGAGAAA 

SEQ2711 ATACAAGTGATAAGAATACTGACACGAGTGTCGTGACTACGACCTTATCTGAGGAGAAA 
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Table 27: Comparative Sequences relating to SAG1473 
(cell wall surface anchor family protein) 



SEQ27 01 
SEQ2702 
SEQ2703 
SEQ2704 
SEQ2705 
SEQ2706 
SEQ27 07 
SEQ2709 
SEQ2710 
SEQ2711 



GAT C AGATGAACTAGAC C AGTCTAGTACTGGT TC TTC TTCTGAAAATGAATCGAGTTCA 
GAT C AGATGAACTAGAC CAGTC TAGTAC TGGT TC TTC TTCTGAAAATGAATCGAGT TCA 
GATCAGATGAACTAGACCAGTCTAGTACTGGTTCTTCTTCTGAAAATGAATCGAGTTCA 

GACCAGTCTAGTACTGGTTCTTCTTCTGAAAATGAATCGAGTTCA 

GATTAGATGAACTAGACCAGTCTAGTACTGGTTCTTCTTCTGAAAATGAATCGAGTTCA 
GATC AGATGAACTAGAC C AGTCTAGTACTGGT TCTTCTTCTGAAAATGAATCGAGT TCA 
GATC AGATGAACTAGAC C AGTCTAGTACTGGT TCTTC TTC TGAAAATGAATCAAGTTCA 
G ATTAG ATGAAC TAGAC C AGTCTAGTAC TGGT TCTTCTTC TGAAAATGAATCGAGTT CA 
GAT C AGATGAAC TAGAC C AGTCTAGTAC TGGTT C TTCT TCTGAAAATGAAT CAAGTTCA 
GAT CAGATGAACTAGACCAGTC TAGTAC TGGTTCTT CTTCTGAAAATGAATCAAGT TCA 



SEQ2701 
SEQ2702 
SEQ27 03 
SEQ2704 
SEQ2705 
SEQ2706 
SEQ2707 
SEQ2709 
SEQ2710 
SEQ2711 



TCAAGTGAACCAGAAACAAATCCGTCAACTAATCCACCTACAACAGAACCATCGCAACCC 
TC AAGTGAACC AGAAACAAAT CCGT CAACTAATCC AC C TACAAC AGAACCAT CGCAAC CC 
TCAAGTGAACCAGAAACAAATCCGTCAACTAATCCACCTACAACAGAACCATCGCAACCC 
TCAAGTGAACCAGAAACAAATCCGTCAACTAATCCACCTACAACAGAACCATCGCAACCC 
TCAAGTGAACCAGAAACAAATCCCTCAACTAATCCACCTACAACAGAACCATCGCAACCC 
TCAAGTGAAC C AGAAACAAATC CGT C AACTAATC C AC C TACAACAGAAC C ATCGCAAC CC 
TC AAGTGAAC CAGAAACAAATCC CT CAACTAAT C CAC CTAC AAC AGAAC CATCGCAACCC 
TCAAGTGAACCAGAAACAAATC C CT CAACTAATCCACCTACAAC AGAACCAT CGCAAC CC 
TCAAGTGAACCAGAAACAAATCCCTCAACTAATCCACCTACAACAGAACCATCGCAACCC 
TCAAGTGAACCAGAAACAAATCCCTCAACTAATCCACCTACAACAGAACCATCGCAACCC 



SEQ2701 
SEQ2702 
SEQ27 03 
SEQ2704 
SEQ27 05 
SEQ2706 
SEQ2707 
SEQ2709 
SEQ2710 
SEQ2711 



TC AC C TAGTGAAGAGAACAAGCCTGATGGTAGAACGAAGACAGAAATTGGCAATAATAAG 
T C AC CTAGTGAAGAGAAC AAGCC TGATGGTAGAACGAAGACAGAAATTGGCAATAATAAG 
TCAC CTAGTGAAGAGAACAAGC CTGATGGTAGAACGAAGAC AGAAATTGGC AATAATAAG 
TCACCTAGTGAAGAGAACAAGCCTGATGGTAGAACGAAGACAGAAATTGGCAATAATAAG 
TCACCTAGTGAAGAGAAC AAGCC TGATGGTAGCACGAAGACAGAAATTGGCAATAATAAG 
TCAC CTAGTGAAGAGAACAAGC CTGATGGTAGAACGAAGAC AGAAATTGGC AATAATAAG 
TCACCTAGTGAAGAGAACAAGCCTGATGGGAGCACGAAGACAGAAATTGGCAATAATAAG 
T C A C C T AG TG AAG AG AA C AAG C C T GAT GGT AG C AC G AAG A C AGAAAT T GG C AATAATAAG 

TCACCTAGTGAAGAGAACAAGCCTGATGGGAGCACGAAGACAGAAATTGGCAATAATAAG 
TCAC CTAGTGAAGAGAACAAGC C TG ATGGGAG C ACGAAGACAGAAATTGGCAATAATAAG 



SEQ2701 
SEQ27 02 
SEQ2703 
SEQ2704 
SEQ2705 
SEQ2706 
SEQ2707 
SEQ2709 
SEQ2710 
SEQ2711 



GATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTATTAAGAATTTTAGTAAA 
GATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTATTAAGAATTTTAGTAAA 
GATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTATTAAGAATTTTAGTAAA. 
GATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTATTAAGAATTTTAGTAAA 
GATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTATTAAGAATTTTAGTAAA 
GATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTATTAAGAATTTTAGTAAA 
GATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTATTAAGAATTTTAGTAAA 
GATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTATTAAGAATTTTAGTAAA 
GATATT T CTAGTGGAACAAAAGTATTAATTTC AGAAGATAGTATTAAGAATTTTAGTAAA 
GATATTTCTAGTGGAACAAAAGTATTAATTTCAGAAGATAGTATTAAGAATTTTAGTAAA 



SEQ2701 
SEQ2702 
SEQ2703 
SEQ2704 
SEQ2705 
SEQ2706 
SEQ2707 
SEQ2709 
SEQ2710 
SEQ2711 



GC AAGT AGTGATC AAGAAGAAGTGGATC GCGATGAAT C ATCATC TTC AAAAGC AAGTGAT 
G C AAGT AG TG AT C AAGAAG AAG TGGAT C G C G ATG AAT CAT CAT C T T C AAAAG C AAATG AT 
G C AAGT AG T GAT C AAGAAGAAGTG GAT C G C GAT G AAT CAT CAT C T T C AAAAGC AAATG AT 
G C AAGT AGTGAT CAAGAAGAAGTGGATCGCGATGAAT CAT CATC TTCAAAAGCAAATGAT 
G C AAGTAGTGAT CAAGAAGAAGTGGATCGCGATGAATC ATCATC TT CAAAAGCAAATGAT 
GCAAGTAGTGATCAAGAAGAAGTGGATCGCGATGAATCATCATCTTCAAAAGCAAATGAT 
GCAAGTAGTGATCAAGAAGAAGTGGAACGCGATGAATCATCATCTTCAAAAGCAAATGAT 
GCAAGTAGTGATCAAGAAGAAGTGGATCGCGATGAATCATCATCTTCAAAAGCAAATGAT 
GC AAGTAGTGATCAAGAAGAAGTGGAACGCGATGAAT C ATC AT CT T CAAAAGCAAATGAT 

GCAAGTAGTGATCAAGAAGAAGTGGATCGCGATGAATCATCATCTTCAAAAGCAAATGAT 
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SEQ27 01 
SEQ2702 
SEQ2703 
SEQ2704 
SEQ2705 
SEQ2706 
SEQ2707 
SEQ2709 
SEQ2710 
SEQ2711 

SEQ2701 
SEQ2702 
SEQ2703 
SEQ2704 
SEQ2705 
SEQ2706 
SEQ27 07 
SEQ2709 
SEQ2710 
SEQ2711 

SEQ2701 
SEQ2702 
SEQ2703 
SEQ2704 
SEQ2705 
SEQ2706 
SEQ2707 
SEQ2709 
SEQ2710 
SEQ2711 

SEQ2701 
SEQ2702 
SEQ2703 
SEQ27 04 
SEQ2705 
SEQ2706 
SEQ2707 
SEQ2709 
SEQ2710 
SEQ2711 

SEQ2701 
SEQ2702 
SEQ2703 
SEQ2704 
SEQ2705 
SEQ2706 
SEQ2707 
SEQ2709 
SEQ2710 
SEQ2711 



Table 27: Comparative Sequences relating to SAG1473 
(cell wall surface anchor family protein) 

GGGAAAAAAGGCCACAGTAAGCCTAAAAAGGAA 

GGGAAAAAAGGCCACAGTAAGCCTAAAAAGGAA 

GGG AAAAAAGG C C AC AGT AAG C C TAAAAAGGAA 

GGGAAAAAAGGCCACAGTAAGCCTAAAAAGGAA 

GAGAAAAAAGGCCACAGTAAGCCTAAAAAGGAA 

GGGAAAAAAGG CCACAGTAAGC CTAAAAAGGAA 

GAGAAAAAAGGCCACAGTAAGCCTAAAAAGGAASGATACAAGTGATAAGAATACTGACAC 

GAGAAAAAAGGCCACAGTAAGCCTAAAAAGGAA 

GAGAAAAAAGGCCAC AGTAAGC C TAAAAAGGAA 

GAGAAAAAAGGC CACAGTAAGC CT AAAAAGGAATAB CMARAT VS TNC S RATNGT S AGCWA 

AGTGTCGTGAC TACGAC CTTAT CTGAGGAGAAAAGATTAGATGAACTAGAC CAGTC TAG 
TRACANCHRAMYRTN 

ACTGGTTCTTCTTCTGAAAATGAATCGAGTTCATCAAGTGAACCAGAAACAAATCCCTC 
ACTAATC C AC CTACAAC AGAACC ATCGC AACC CTCAC CTAGTGAAGAGAACAAGCC TGA 
GGTAGC ACGAAGAC AGAAATTGGCAAT AATAAGGATATT TCTAGTGGAAC AAAAGT AT T 
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SEQ2701 
SEQ2702 
SEQ2703 
SEQ2704 
SEQ27 05 
SEQ27 06 
SEQ27 07 
SEQ27 09 
SEQ2710 
SEQ2711 



Table 27: Comparative Sequences relating to SAG1473 
(cell wall surface anchor family protein) 



ATTTCAGAAGATAGTATTAAGAATTTTAGTAAAGCAAGTAGTGATCAA.GAARAAGTGGA 



SEQ2701 
SEQ2702 
SEQ2703 
SEQ2704 
SEQ2705 
SEQ2706 
SEQ2707 
SEQ2709 
SEQ2710 
SEQ2711 



CGCGATGAATCATC ATC TTC AAAAGC AAATGATGAGAAAAAAGGCCACAGTAAGC C TAA 



SEQ2701 
SEQ2702 
SEQ2703 
SEQ2704 
SEQ2705 
SEQ2706 
SEQ2707 
SEQ2709 
SEQ2710 
SEQ2711 



AAGGAA 



>SEQ ID NO 2750:4_1169NT frame: 1 

DTSDKNTDTSWTTTLSEEKRSDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 
SPSEENKPDGRTKTEIGNNKDISSGTKVLISEDSIKNFSKASSDQEEVDRDESSSSKASD 
GKKGHS KPKKE 

>SEQ ID NO 2751:4_18RS21 frame: 1 

DTSDKNTDTSWTTTIiSEEKRSDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 
SPSEENKPDGRTKTEIGNNKDISSGTKVLISEDSIKNFSKASSDQEEVDRDESSSSKAND 
GKKGHS KPKKE 

>SEQ ID NO 2752:4_2603 frame: 1 

DTSDKNTDTSWTTTLSEEKRSDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 
SPSEENKPDGRTKTEIGNNKDISSGTKVLISEDSIKNFSKASSDQEEVDRDESSSSKAND 
GKKGHS KPKKE 

>SEQ ID NO 2753:4_090 frame: 1 

DQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQPSPSEENKPDGRTKTEIGNNKDISSG 
TKVLISEDSIKNFSKASSDQEEVDRDESSSSKANDGKKGHSKPKKE 

>SEQ ID NO 2754:4_A909 frame: 1 

DTSDKNTDTSWTTTLSEEKRLDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 
SPSEENKPDGSTKTEIGlWKDISSGTKOTiISEDSIKNFSKASSDQEEVDRDESSSSKAND 
EKKGHS KPKKE 

>SEQ ID NO 2755:4_CJB110 frame: 1 

DTSDKNTDTSWTTTLSEEKRSDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 
SPSEENKPDGRTKTEIGNNKDISSGTKVLISEDSIKNFSKASSDQEEVDRDESSSSKAND 
GKKGHS KPKKE 
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Table 27: Comparative Sequences relating to SAG1473 
(cell wall surface anchor family protein) 

>SEQ ID NO 2756:4_COHl frames 1 

DTSDKNTDTSVVTTTLSEEKRSDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 
SPSEENKPDGSTKTEIGNNKDI S SGTKVLI SEDS I KNFSKASSDQEEVERDESSSS KAND 
EKKGHS KPKKE 

>SEQ ID NO 2757:4JK36B frame: 1 

DTSDKNTDTSWTTTLSEEKRLDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 
SPSEENKPDGSTKTEIGNNKDI S SGTKVLI SEDS IKNFSKASSDQEXVDRDESSS S KAND 
EKKGHSKPKKE 

>SEQ ID NO 2758 :4_JM913 00X3 frame: 1 

DTSDKNTDTSVVTTTLSEEKRLDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 
SPS EENKPDGSTKTEI GNNKDI S SGTKVLI SEDSI KNFSKASSDQEEVDRDESS S SKAND 
EKKGHSKPKKE 

* 

>SEQ ID NO 2759:4_M732 frame: 1 

DTSDKNTDTSVVTTTLSEEKRSDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 
SPS EENKPDGS TKTE I GNNKDI S SGTKVLI S EDS I KNF S KASSDQEEVERDE SS S S KAND 
EKKGHSKPKKE 

>SEQ ID NO 2760:4_M7 81 frame: 1 

DTSDKNTDTSVVTTTLSEEKRSDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 
SPS EENKPDGSTKTEIGNNKDI SSGTKVLI SEDS IKNFSKASSDQEEVDRDESS SSKAND 
EKKGHSKPKKE 



SEQ275 0 TSDKNTDTSWTTTLSEEKRSDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 

SEQ2751 TSDKNTDTSWTTTLSEEKRSDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 

SEQ27 52 TSDKNTDTSWTTTLSEEKRSDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 

SEQ2753 DQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 

SEQ2754 TSDKNTDTSWTTTLSEEKRLDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 

SEQ2755 TSDKNTDTSWTTTLSEEKRSDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 

SEQ2756 TSDKNTDTSWTTTLSEEKRSDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 

SEQ2757 TSDKNTDTSWTTTLSEEKRLDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 

SEQ2758 TSDKNTDTSWTTTLSEEKRLDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 

SEQ2759 TSDKNTDTSVVTTTLSEEKRSDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 

SEQ27 6 0 TSDKNTDTSWTTTLSEEKRSDELDQSSTGSSSENESSSSSEPETNPSTNPPTTEPSQP 

SEQ2750 S PS EENKPDGRTKTEI GNNKDI SSGTKVLI SEDS I KNFSKASSDQEEVDRDESSSSKASD 

SEQ2751 SPS EENKPDGRTKTEIGNNKDI SSGTKVLI SEDSI KNFSKASSDQEEVDRDESSSSKAND 

SEQ2752 SPSEENKPDGRTKTEIGNNKDISSGTKVLI SEDS I KNFSKASSDQEEVDRDESSSSKAND 

SEQ2753 SPSEENKPDGRTKTEIGNNKDISSGTKVLISEDSIKNFSKASSDQEEVDRDESSSSKAND 

SEQ2754 SPSEENKPDGSTKTEIGNNKDI SSGTKVLI SEDS I KNFSKASSDQEEVDRDESSSSKAND 

SEQ2755 SPSEENKPDGRTKTEI GNNKDI SSGTKVLI SEDS IKNFSKASSDQEEVDRDESSSS KAND 

SEQ2756 SPSEENKPDGSTKTEIGNNKDI SSGTKVLI SEDS I KNFSKASSDQEEVERDESSSS KAND 

SEQ2757 SPSEENKPDGSTKTEIGNNKDI SSGTKVLI SEDSI KNFSKASSDQEXVDRDESS SSKAND 

SEQ2758 SPSEENKPDGSTKTEIGNNKDISSGTKVLI SEDS I KNFSKASSDQEEVDRDESSSSKAND 

SEQ2759 SPSEENKPDGSTKTEIGNNKDISSGTKVLI SEDS IKNFSKASSDQEEVERDESS SSKAND 

SEQ27 6 0 SPSEENKPDGSTKTEIGNNKDISSGTKVLISEDSIKNFSKASSDQEEVDRDESSSSKAND 

SEQ2750 KKGHSKPKKE 

SEQ2751 KKGHSKPKKE 

SEQ2752 KKGHSKPKKE 

SEQ2753 KKGHSKPKKE 

SEQ2754 KKGHSKPKKE 

SEQ2755 KKGHSKPKKE 

SEQ2756 KKGHSKPKKE 

SEQ2757 KKGHSKPKKE 

SEQ2758 KKGHSKPKKE 

SEQ2759 KKGHSKPKKE 

SEQ2760 KKGHSKPKKE 
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Table 28: Comparative Sequences relating to SAG1552 

(conserved hypothetical protein) 

SEQ ID NO. 2801: SAG1552 FROM THE 1169NT1 GBS TYPE V STRAIN 
(REVERSE COMPLEMENT) 

TTTGTTGTTAAAGGTGATACTGTACTTCACAAGCCCACCAATAAACCTTTTGTTGTTAAAGGAGTAGACGTTGAGTCT 
TCCTTAGCAGGTTATCATCACAACGATTTTCCTATTACTCAAAAAACGTATCGTGAGTGGTTCCATTTAATTTCCAAC 
ATGGGGGCAAATACTGTAAGAGTCAAAGTACCGATGAATGTTGCATTTTACGATGCTTTATATCACCACAACAAAGCA 
TCAAAGAGGCCACTGTATTTGTTGCAAGGAATACGTATAGATTCTTATCGCAATAATGCTTCTATAACAGCTTTTAAT 
GATAATTATAGGGGGT ATTTAAAACGAGAAGCAAAAGGCGT TGTGGATAT TCTC C ATGGGC GTAAGCAAGTATGGAAT 
ACTGATTTTGGTAGCCGTCATTATCATTATGATCTTAGTCCTTGGGTACTTGGTTATGTCGTAGGGGATGATTGGAAT 
AGTGGTACTGTCGCTTATACTAATCATCAAGAGAAAAAAACGCAATATAAAGGACGTTATTTTAAAACTTCTGCGGCA 
GCTAATC CATTTGAGGT C ATGC TAGC T CAAGTTATGGATGAATTGACAC ATTATGAGACAGCTAAATATGGTTGGCAA 
CATTTGATTAGTTTTTCAAACTCACCAACAACAGACCCTTTTCGTTATCGAAAACCATTTGAGGCACAGGCTCCTAAA 
TACGTACAACTAAATGTAGAAAATATTCAAGCTAATTCGAATGTTAAAGCAGGTATTTTTGCAGCATATAAAGCTATT 
GATTTCCATCCTCGATACAAGGATTATCTATTATTTGATAAAGAGAATATCAGTAAAGAAGATAGACAAAAGATTAAA 
GAACTTTCTTTGTCACAGGGATACGTTAAACTGCTAAATGCTTATCACAAAATCCCTGTTCTAGTCACGGGTTATGGC 
TATTCGACAGCGAGAGGTATTGCCCAAAAAGAAATTGATAAACGTCCTCTGCCGATTAATGAAAAAGAACAAGGTCAG 

CGTTTACTAGAAGATTATGAAT C TTTTATATC ATC CGGTAGTTTTGGAGCGACTATC AATGC ATGGCAAGACGATTGG 
AATGCAAGGGCGTGGAATACATCCTTCGCCACAAATAAACATAGTCAATTCCTATGGGGGGATGCACAAGTATTTAAT 
CAAGGTTATGGT T TAT T AGGC TTT AAAAACGCAAAAC ATC ATTATCAAGTTGATGGTAAAAGAGGC AAAGGAGAGTGG 
AAACATCCTCTG 

SEQ ID NO. 2802: SAG1552 FROM THE 

ATGACTAGTGCAACAGGAGATGACTTATATGCTAGCAGTGATGAAAGCTATCTCTACCTTGCGATTAAAACAAAACCT 

GAAAAACTAAAAGAAAAACGATTATT AC CAAT AGATATTAC ACCAAAAT C TGGTAGTAGAAAAATGAATGGT AGTAAG 
GTCACATTTTCTAAATCTAGTGACTTTGTATTGTCTATTGATCCAAATGGCAAGTCTGAATTATTTGTCCAAGAGCGC 

TAT AATGC CTTAAAAGCGAACTAT CTTCGAC AGCTT AACGGTAAAGATTTTT ATGCT TTC CC AC C AAAGAAGAAC AGT 
AGTAATTTTGAGCAGATCAATATGGTATTGAGAAATACAAAGATTGTTGAAGACATGGAAAAAGTAAAAGCAACAGAG 
AGGTTCTTACCAACTCATCCTACTGGTCTTCTCAAAACAGGAACAATTGATAGGCACCAAAAAACATTTGATTCACAA 
ACAGATATTTCGTTTGGAAAGGACTTTATAGAGGTCAGAATTCCGTGGCAGTTGTTGAATTTTTCTGATCCATCATCT 
CAAAAAATTCACGATGATTACTTTAAACATTATGGTGTGAAGGAGTTAGAAATTGAGAGCATTGCTTTAGGATTAGGT 
GCTAATAGCAAAGAAAACACACTGATAAAGATGGCAGATTATCGTTTGAAAAATTGGGAGAGACCCGATACCAAAACC 

TTTTTAAAAGAC T CCTATT ATAGT ATTTAAGAAAGAA 

SEQ ID NO. 2803: SAG1552 FROM THE 18RS21 GBS TYPE II STRAIN 

AAGGGCTTATTAAAAGAAAATACAAGAACTAACTTTGTTGTTAAAGGTGATACTGTACTTCACAAGCCCACCAATAAA 
CCTTTTGTTGTTAAAGGAGTAGACGTTGAGTCTTCCTTAGCGGGTTATCATCACAACGATTTTCCTATTACTCAAAAA 
ACGTATCGTGAATGGTTC C ATTT AATTTCC AACATGGGGGCAAATAC TGTAAGAGT C AAGGTAC CGATGAATGTTGC A 
TTTTACGATGCCTTATATCACCACAACAAAGCATCAAAGAGGCCACTGTATTTGTTGCAAGGAATACGTATAGATTCT 
TATCGCAATAATGCTTCTATAACAGCTTTTAATGATAATTATAGGGGGTATTTAAAACGAGAAGCAAAAGGCGTTGTG 
GATATTCTCCATGGGCGTAAGCAAGTATGGAATACTGATTTGGGTAGCCGTCATTATCATTATGATCTTAGTCCTTGG 
GTACTTGGTTATGT CGTAGGGGATGATTGGAATAGTGGTACTGTCGC TTATACTAATC AT CAAGAGAAAAAAACGCAA 
TATAAAGGACGTTATTTTAAAACTTCTGTGGCAGCTAATCCATTTGAGGTCATGCTAGCTCAAGTAATGGATGAATTG 
ACACATTATGAGACAGCTAAATATGGTTGGC AACATTTGATTAGTTTT TC AAAC TCACCAAC AACAGAC C CTTTTC AT 
TATCGAAAACCATTTGAGGCACAGGCTCCTAAATACGTACAACTAAATGTAGAAAATATTCAAGCTAATTCAAATGTT 
AAAGCAGGTATGTTTGCAGCATATAAAGCTATTGATTTCCATCCTCGATACAAGGATTATCTATTATTTGATAAAGAG 

AATATC AGTAAAGAAGATAGAC AAAAGATTAAAGAAC TTT CTTTGT C AC AGGGATACGTTAAAC TGCTAAATGC TT AT 
CACAAAATCCCTGTTCTAGTCACGGGTTATGGCTATTCGACAGCGAGAGGTATTGCCCAAAAAGAAATTGATAAACGT 
CCTCTGCCGATTAATGAAAAAGAACAAGGTCAGCGTTTACTAGAAGATTATGAATCTTTTATATCATCCGGTAGTTTT 
GGAGCGACTATCAATGCATGGCAAGACGATTGGAATGCAAGGGCGTGGAATACATCTTTCGCCACAAATAAACATAGT 
CAATTCCTATGGGGGGATGCACAAGTATTTAATCAAGGTTATGGTTTATTAGGCTTTAAAAACGCAAAACATCATTAT 
CAAGTTGATGGTAAAAGAGGCAAAGGAGAGTGGAAACATCCTCTGATGACTAGTGCAACAGGAGATGACTTATATGCT 
AGCAGTGATGAAAGCTATCTCTACCTTGCGATTAAAACAAAACCTGAAAAACTAAAAGAAAAACGATTATTACCAATA 
GATATTACACCAAAATCTGGTAGTAGAAAAATGAATGGTAGTAAGGTCACATTTTCTAAATCTAGTGACTTTGTATTG 
TC TATTGATC CAAATGGCAAGTCTGAATT ATTTGTC CAAGAGCGCTATAATGCCTTAAAAGCGAAC TAT CTT CGAC AG 
C TTAACGGTAAAG ATTTTTATGC TTTCC C AC CAAAGAAGAAC AGTAGT AATTTTGAGC AGATAAATATGGT AT TGAGA 
AATACAAAGATTGTTGAAGACATGGAAAAAGTAAAAGCAACAGAGAGGTTCTTACCAACTCATCCTACTGGTCTTCTC 

AAAACAGGAACAACTGATAGGCAC C AAAAAAC ATTTGATTC AC AAACAGATATTTCGTTT GGAAAGGAC T T TAT AGAG 
Gp?CAGAATTCCGTGGCAGTTGTTGAATTTTTCTGATCCATCATCTCAAAAAATTCACGATGATTACTTTAAACATTAT 
GGTGTGAAGGAGTTAGAAATTGAGAGCATTGCTTTAGGATTAGGTGCTAATAGCAAAGAAAACACACTGATAAAGATG 
GCAGATTATCGTTTGAAAAATTGGGAGAGAC C C GAT AC C AAAAC CTTT TT AAAAG ACTC C TATT ATGTATTAAGAAAG 
AA 
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Table 28: Comparative Sequences relating to SAG1552 
(conserved hypothetical protein) 

SSQ ID NO. 2804: SAG1552 FROM THE 2603 V/R GBS TYPE V STRAIN 
(REVERSE COMPLEMENT) 

TATTAAAAGAAAATACAAGAACTAACTTTGTTGTTAAAGGTGATACTGTACTTCACAAGCCCACCAATAAACCTTTTG 
TTGTTAAAGGAGTAGACGTTGAGTCTTCCTTAGCGGGTTATCATCACAACGATTTTCCTATTACTCAAAAAACGTATC 
GTGAATGGTTCCATTTAATTTCCAACATGGGGGCAAATACTGTAAGAGTCAAGGTACCGATGAATGTTGCATTTTACG 
ATGCCTTATATCACCACAACAAAGCATCAAAGAGGCCACTGTATTTGTTGCAAGGAATACGTATAGATTCTTATCGCA 
ATAATGCTTCTATAACAGCTTTTAATGATAATTATAGGGGGTATTTAAAACGAGAAGCAAAAGGCGTTGTGGATATTC 
TCCATGGGCGTAAGCAAGTATGGAATACTGATTTGGGTAGCCGTCATTATCATTATGATCTTAGTCCTTGGGTACTTG 
GTTATGTCGTAGGGGATGATTGGAATAGTGGTACTGTCGCTTATACTAATCATCAAGAGAAAAAAACGCAATATAAAG 
GACGTTATTTTAAAACTTCTGTGGCAGCTAATCCATTTGAGGTCATGCTAGCTCAAGTAATGGATGAATTGACACATT 
ATGAGAC AGCTAAATATGGTTGGC AACATTTGATTAGTTTTTCAAACT C AC CAACAACAGACCCTTTT C ATTATCGAA 
AAC CATTTGAGG CACAGGCT C C TAAATACGTAC AACTAAATGTAGAAAATATTC AAGCTAATTC AAATGTTAAAGCAG 
GTATGTTTGCAGCATATAAAGCTATTGATTTCCATCCTCGATACAAGGATTATCTATTATTTGATAAAGAGAATATCA 
GTAAAGAAGATAGACAAAAGATTAAAGAACTTTCTTTGTCACAGGGATACGTTAAACTGCTAAATGCTTATCACAAAA 
TCCCTGTTCTAGTCACGGGTTATGGCTATTCGACAGCGAGAGGTATTGCCCAAAAAGAAATTGATAAACGTCCTCTGC 
CGATTAATGAAAAAGAACAAGGTCAGCGTTTACTAGAAGATTATGAATCTTTTATATCATCCGGTAGTTTTGGAGCGA 
CTATCAATGCATGGCAAGACGATTGGAATGCAAGGGCGTGGAATACATCTTTCGCCACAAATAAACATAGTCAATTCC 
TATGGGGGGATGCACAAGTATTTAATCAAGGTTATGGTTTATTAGGCTTTAAAAACGCAAAACATCATTATCAAGTTG 
ATGGTAAAAGAGGCAAAGGAGAGTGGAAACATCCTCTGATGACTAGTGCAACAGGAGATGACTTATATGCTAGCAGTG 
ATGAAAGCTATCTCTACCTTGCGATTAAAACAAAACCTGAAAAACTAAAAGAAAAACGATTATTACCAATAGATATTA 
CACCAAAATCTGGTAGTAGAAAAATGAATGGTAGTAAGGTCACATTTTCTAAATCTAGTGACTTTGTATTGTCTATTG 
ATCCAAATGGCAAGTCTGAATTATTTGTCCAAGAGCGCTATAATGCCTTAAAAGCGAACTATCTTCGACAGCTTAACG 
GTAAAGATTTTTATGCTTTCCCACCAAAGAAGAACAGTAGTAATTTTGAGCAGATAAATATGGTATTGAGAAATACAA 
AGATTGTTGAAGACATGGAAAAAGTAAAAGCAAC AGAGAGGTTC TTAC C AACTC ATCCTAC TGGT CTTC TC AAAACAG 
GAACAACTGATAGGCACCAAAAAACATTTGATTCACAAACAGATATTTCGTTTGGAAAGGACTTTATAGAGGTCAGAA 
TTCCGTGGCAGTTGTTGAATTTTTCTGATCCATCATCTCAAAAAATTCACGATGATTACTTTAAACATTATGGTGTGA 
AGGAGTTAGAAATTGAGAGCATTGCTTTAGGATTAGGTGCTAATAGCAAAGAAAACACACTGATAAAGATGGCAGATT 
ATCGTTTGAAAAATTGGGAGAGACCCGATACCAAAACCTTTTTAAAAGACTCCTATTATAGTATTAAGAAAGAATGGT 
CTAAAGAAAGAGAGAGAACATATGGTCCA 

SEQ ID NO. 2805s SAG1552 FROM THE A909 GBS TYPE la STRAIN 
(REVERSE COMPLEMENT) 

AAGGGCTTATTAAAAGAAAATACAAGAACTAACTTTGTTGTTAAAGGTGATACTGTACTTCACAAGCCCACCAATAAA 
CCTTTTGTTGTTAAAGGAGTAGACGTTGAGTCTTCCTTAGCGGGTTATCATCACAACGATTTTCCTATTACTCAAAAA 
ACGTATCGTGAATGGTTCCATTTAATTTCCAACATGGGGGCAAATACTGTAAGAGTCAAGGTACCGATGAATGTTGCA 
, TTTTACGATGCCTTATATCACCACAACAAAGCATCAAAGAGGCCACTGTATTTGTTGCAAGGAATACGTATAGATTCT 
TATCGCAATAATGCTTCTATAACAGCTTTTAATGATAATTATAGGGGGTATTTAAAACGAGAAGCAAAAGGCGTTGTG 
GATATTCTCCATGGGCGTAAGCAAGTATGGAATACTGATTTGGGTAGCCGTCATTATCATTATGATCTTAGTCCTTGG 
GTACTTGGTTATGTCGTAGGGGATGATTGGAATAGTGGTACTGTCGCTTATACTAATCATCAAGAGAAAAAAACGCAA 
TATAAAGGACGTTATTTTAAAACTTCTGTGGCAGCTAATCCATTTGAGGTCATGCTAGCTCAAGTAATGGATGAATTG 
ACACATTATGAGACAGCTAAATATGGTTGGCAACATTTGATTAGTTTTTCAAACTCACCAACAACAGACCCTTTTCAT 
TATCGAAAAC CATTTGAGGCAC AGGCT CCTAAATACGTAC AAC TAAATGTAGAAAATATTC AAGCTAATTCAAATGTT 
AAAGCAGGTATGTTTGCAGCATATAAAGCTATTGATTTCCATCCTCGATACAAGGATTATCTATTATTTGATAAAGAG 
AATATCAGTAAAGAAGATAGACAAAAGATTAAAGAACTTTCTTTGTCACAGGGATACGTTAAACTGCTAAATGCTTAT 
CACAAAATCCCTGTTCTAGTCACGGGTTATGGCTATTCGACAGCGAGAGGTATTGCCCAAAAAGAAATTGATAAACGT 
C CT CTGCCGATTAATGAAAAAGAAC AAGGTC AGCGTTTACTAGAAGATTATGAATCTTTT ATATC ATC CGGT AGTTTT 
GGAGCGACTATCAATGCATGGCAAGACGATTGGAATGCAAGGGCGTGGAATACATCTTTCGCCACAAATAAACATAGT 
CAATTCCTATGGGGQGATGCACAAGTATTTAATCAAGGTTATGGTTTATTAGGCTTTAAAAACGCAAAACATCATTAT 
CAAGTTGATGGT AAAAGAGG C AAAGGAGAGTGGAAACATC CTCTGATGACTAGTGC AACAGGAGATG AC TTATATGCT 
AGCAGTGATGAAAGCTATCTCTACCTTGCGATTAAAACAAAACCTGAAAAACTAAAAGAAAAACGATTATTACCAATA 
GATATTACACCAAAATCTGGTAGTAGAAAAATGAATGGTAGTAAGGTCACATTTTCTAAATCTAGTGACTTTGTATTG 
TCTATTGATCCAAATGGCAAGTCTGAATTATTTGTCCAAGAGCGCTATAATGCCTTAAAAGCGAACTATCTTCGACAG 
C TTAACGGTAAAGAT TTTTATGCTTTC C CAC CAAAGAAGAAC AGTAGTAATTTTGAGCAG ATAAATATGGTATTGAGA 
AATACAAAGATTGTTGAAGACATGGAAAAAGTAAAAGCAACAGAGAGGTTCTTACCAACTCATCCTACTGGTCTTCTC 
AAAACAGGAACAACTGATAGGCACCAAAAAACATTTGATTCACAAACAGATATTTCGTTTGGAAAGGACTTTATAGAG 
GTCAGAATTCCGTGGCAGTTGTTGAATTTTTCTGATCCATCATCTCAAAGAATTCACGATGATTACTTTAAACATTAT 
GGTGTGAAGGAGTTAGAAAATTGAGAGCCATTGCTTTAGGATTAGGTGCTAATAGCAAAGAAAACACACTGATAAAGA 
TGGCAGATTATCGTTTGAAAAATTGGGAGAGACCCGATACCAAAACCTTTTTAAAAGA 
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Table 28: Comparative Sequences relating to SAG1552 
(conserved hypothetical protein) 

SEQ ID NO. 2806: SAG1552 FROM THE CJB110 GBS NONTYPEABLE STRAIN 

TATTACTTTGATGGTAGTTTGTATTTACCAAAGGGCTTATTAAAAGAAAATACAAGAACTAACTTTGTTGTTAAAGGT 
GATACTGTACTTCACAAGCCCACCAATAAACCTTTTGTTGTTAAAGGAGTAGACGTTGAGTCTTCCTTAGCGGGTTAT 
CATCACAACGATTTTCCTATTACTCAAAAAACGTATCGTGAATGGTTCCATTTAATTTCCAACATGGGGGCAAATACT 
GTAAGAGTCAAGGTACCGATGAATGTTGCATTTTACGATGCCTTATATCACC^C^CAAAGCATC^AAGAGGCCACTG 
TATTTGTTGCAAGGAATACGTATAGATTCTTATCGCAATAATGCTTCTATAACAGCTTTTAATGATAATTATAGGGGG 
TATTTAAAACGAGAAGCAAAAGGCGTTGTGGATATTCTCCATGGGCGTAAGCAAGTATGGAATACAGATTTTGGTAGC 
CGTCATTATCATTATGATCTTAGTCCTTGGGTACTTGGTTATGTCGTAGGGGATGATTGGAATAGTGGTACTGTCGCT 
TATACTAATCATCAAGAGAAAAAAACGCAATATAAAGGACGTTATTTTAAAACTTCTGTGGCAGCTAATCCATTTGAG 
GTCATGCTAGCTCAAGTAATGGATGAATTGACACATTATGAGACAGCTAAATATGGTTGGCAACATTTGATTAGTTTT 
TCAAACTCACCAACAACAGACCCTTTTCATTATCGAAAACCATTTGAGGCACAGGCTCCTAAATACGTACAACTAAAT 
GTAGAAAATATTCAAGCTAATTCAAATGTTAAAGCAGGTATGTTTGCAGCATATAAAGCTATTGATTTCCATCCTCGA 
TACAAGGATTATCTATTATTTGATAAAGAGAATATCAGTAAAGAAGATAGACAAAAGATTAAAGAACTTTCTTTGTCA 
C AGGGATACGTTAAACTGCTAAATGCTTATC ACAAAAT C C C TGTTC TAGTCACGGGTTATGGC TATTCGACAGCGAGA 
GGTATTGCCCAAAAAGAAATTGATAAACGTCCTCTGCCGATTAATGAAAAAGAACAAGGTCAGCGTTTACTAGAAGAT 
TATGAATCTTTTATATCATCCGGTAGTTTTGGAGCGACTATCAATGCATGGCAAGACGATTGGAATGCAAGGGCGTGG 
AATACAT CTTTC GCC ACAAATAAAC ATAATCAATTC CTATGGGGGGATGC AC AAGTATTTAATCAAGGTTATGGTTTA 
TTAGGCTTTAAAAACGCAAAACATCATTATCAAGTTGATGGTAAAAGAGGCAAAGGAGAGTGGAAACATCCTCTGATG 
ACTAGTGCAACAGGAGATGACTTATATGCTAGCAGTGATGAAAGCTATCTCTACCTTGCGATTAAAACAAAACCTGAA 
AAACTAAAAGAAAAACGATTATTACCAATAGATATTACACCAAAATCTGGTAGTAGAAAAATGAATGGTAGTAAGGTC 
ACATTTTCTAAATCTAGTGACTTTGTATTGTCTATTGATCCAAATGGCAAGTCTGAATTATTTGTCCAAGAGCGCTAT 
AATGCCTTAAAAGCGAACTATCTT CGAC AGCTTAACGGTAAAGATTTTTATGCTTT C C CACCAAAGAAGAAC AGTAGT 
AATTTTGAGCAGATAAATATGGTATTGAGAAATACAAAGATTGTTGAAGACATGGAAAAAGTAAAAGCAACAGAGAGG 
TTCTTAC C AAC TC AT CC TACTGGTC TTC TCAAAACAGGAAC AAC TGATAGGC AC CAAAAAACATTTGATTCACAAAC A 
GATATTTCGTTTGGAAAGGACTTTATAGAGGTCAGAATTCCGTGGCAGTTGTTGAATTTTTCTGATCCATCATCTCAA 
AAAATTCACGATGATTACTTTAAACATTATGGTGTGAAGGAGTTAGAAATTGAGAGCATTGCTTTAGGATTAGGTGCT 
AATAGCAAAGAAAACACACTGATAAAGATGGCAGATTATCGTTTGAAAAATTGGGAGAGACCCGATACCAAAACCTTT 
TTAAAAGACTC C TATTATGTATTAAGAAAGA 

SEQ ID NO. 2807: SAG1552 FROM THE COH1 GBS TYPE III STRAIN 

TTTACCACAGGGCTTATTAAAAGAAAATACAAGAACTAACTTTGTTGTTAAAGGTGATACTGTACTTCACAAGCCCAC 
CAATAAACCTTTTGTTGTTAAAGGAGTAGACGTTGAGTCTTCCTTAGCGGGTTATCATCACAACGATTTTCCTATTAC 
T C AAAAAACGTATCGTGAATGGTTC CATTTAATTTC CAAC ATGGGGGC AAATAC TGTAAGAGTCAAGGTAC CGATGAA 
TGTTGCATTTTACGATGCCTTATATCACCACAACAAAGAATCAAAGAGGCCACTGTATTTGTTGCAAGGAATACGTAT 
AGATTCTTATCGCAATAATGCTTCTATAACAGCTTTTAATGATAATTATAGGGGGTATTTAAAACGAGAAGCAAAAGG 
CGTTGTGGATATTCTCCATGGGCGTAAGC AAGTATGGAATACTGATTTTGGTAGC CGTCATTATCAT TATGATC T TAG 
TCCTTGGGTACTTGGTTATGTCGTAGGGGATGATTGGAATAGTGGTACTGTCGCTTATACTAATCATCAAGAGAAAAA 
AACGCAATATAAAGGACGTTATTTTAAAACTTCTGTGGCAGCTAATCCATTTGAGGTCATGCTAGCTCAAGTAATGGA 
TGAATTGACACATTATGAGACAGCTAAATATGGTTGGCAACATTTGATTAGTTTTTCAAACTCACCAACAACAGACCC 
TT T T C ATT AT CG AAAAC C AT TTGAGG C AC AGGC T C C T AAATAC GT AC AAC TAAATGTAGAAAAT AT T C AAG C TAAT T C 
AAATGTTAAAGCAGGTATGTTTGCAGCATATAAAGCTATTGATTTCCATCCTCGATACAAGGATTATCTATTATTTGA 
TAAAGAGAATAT C AGTAAAGAAGATAGAC AAAAGAT TAAAGAACTTTCTTTGTC AC AGGGATACGTTAAAC TGCTAAA 
TGCTTATCACAAAATCC CTGTTCTAGTCACGGGTTATGGC TATT CGAC AGCGAGAGGTAT TGCCC AAAAAGAAATTGA 
T AAACGT CC TCT GCCGATTAATGAAAAAGAAC AAGGTC AGC GTTTACTAGAAGATTATGAATC TTTTATATC ATCCGG 
TAGTTTTGGAGCGACTATCAATGCATGGCAAGACGATTGGAATGCAAGGGCGTGGAATACATCTTTCGCCACAAATAA 
ACATAGTCAATTCCTATGGGGGGATGCACAAGTATTTAATCAAGGTTATGGTTTATTAGGCTTTAAAAACGCAAAACA 
TCATTATCAAGTTGATGGTAAAAGAGGCAAAGGAGAGTGGAAACATCCTCTGATGACTAGTGCAACAGGAGATGACTT 
ATATGCTAGC AGTGATGAAAGC TATCTCTACC TTGCGATTAAAACAAAAC CTGAAAAAC TAAAAGAAAAACGATTATT 
ACCAATAGATATTACACCAAAATCTGGTAGTAGAAAAATGAATGGTAGTAAGGTCACATTTTCTAAATCTAGTGACTT 
TGTATTGTCTATTGATCCAAATGGCAAGTCTGAATTATTTGTCCAAGAGCGCTATAATGCCTTAAAAGCGAACTATCT 
TCGACAGCTTAACGGTAAAGATTTTTATGCTTTCCCACCAAAGAAGAACAGTAGTAATTTTGAGCAGATAAATATGGT 
ATTGAGAAATACAAAGATTGTTGAAGACATGGAAAAAGTAAAAGCAACAGAGAGGTTCTTACCAACTCATCCTACTGG 
TCTTCTCAAAACAGGAACAACTGATAGGCACCAAAAAACATTTGATTCACAACCAGATATTTCGTTTGGAAAGGACXT 
TATAGAGGTCAGAATTCCGTGGCAGTTGTTGAATTTTTCTGATCCATCATCTCAAAAAATTCACGATGATTACTTTAA 
ACATTATGGTGTGAAGGAGTTAGAAATTGAGAGCATTGCTTTAGGATTAGGTGCTAATAGCAAAGAAAACACACTGAT 
AAAGATGG CAGAT TAT CGTTTGAAAAATTGGGAGAGAC C CGATACC AAAACCTTTTTAAAAGACT 
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Table 28: Comparative Sequences relating to SAG1552 
(conserved hypothetical protein) 

SEQ ID NO. 2808: SAG1552 FROM THE H36b GBS TYPE lb STRAIN 

AAGGGGCTTAT TAAAAGAAAATAC AAGAAC TAACTTTGTTGTTAAAGGTGATACTGTACTTC ACAAGC CCAC C AATAA 
ACCTTTTGTTGTTAAAGGAGTAGACGTTGAGTCTTCCTTAGCGGGTTATCATCACAACGATTTTCCTATTACTCAAAA 
AACGTATCGTGAATGGTTCCATTTAATTTCCAACATGGGGGCAAATACTGTAAGAGTCAAGGTACCGATGAATGTTGC 
ATTTTACGATGCCTTATATCACCACAACAAAGCATCAAAGAGGCCACTGTATTTGTTGCAAGGAATACGTATAGATTC 
TTATCGCAATAATGCTTCTATAACAGCTTTTAATGATAATTATAGGGGGTATTTAAAACGAGAAGCAAAAGGCGTTGT 
GGATATT CT CC ATGGGCGTAAGCAAGTATGGAATAC TGATT TTGGTAGCAGT CATTATC ATTATGATCTTAGTCCTTG 
GGTACTTGGTTATGTCGTAGGGGATGATGGACATAGTGGTACTGTCGCTTTATACTAATCATCAAGAGGAGAAAAACG 
CAATATAAAGGACGTTATTTTAAAACTTCTGTGGCAGCTAATCCATTTGAGGTCATGCTAGCTCAAGTAATGGATGAA 
TTGACACATTATGAGACAGCTAAATATGGTTGGCAACATTTGATTAGTTTTTCAAACTCACCAACAACAGACCCTTTT 
CATTATCGAAAACCATTTGAGGCACAGGCTCCTAAATACGTACAACTAAATGTAGAAAATATTCAAGCTAATTCGAAT 
GTTAAAGCAGGTATGTTTGCAGCATATAAAGCTATTGATTTCCATCCTCGATACAAGGATTATCTATTATTTGATAAA 
GAGAATATCAGTAAAGAAGATAGACAAAAGATTAAAGAACTTTCTTTGTCACAGGGATACGTTAAACTGCTAAATGCT 
TATC ACAAAATC CCTGTT CTAGTC ACGGGTTATGGCTAC T CGACAGCGAGAGGTATTGCC CAAAAAGAAATTGATAAA 
CGTC CT CTGCCGATTAATGAAAAAGAAC AAGGTC AGCGTTTACTAGAAGATTATGAATC TTTTATATCATC C GGTAGT 
TTTGGAGCGACTATC AATGC ATGG C AAGACGATTGGAATGC AAGGGTGTGGAATACATC C TTCGC CACAAATAAACAT 
AGTCAATTC CTATGGGGGGATGC ACAAGTATT TAAT C AAGGTTATGGTTTATTAGG CTTTAAAAACGCAAAACAT CAT 
TATCAGGTTGATGGTAAAAGAGGCAAAGAAGAGTGGAAACATCCTCTGATGACTAGTGCAACAGGAGATGACTTATAT 
GCTAGC AGTGATGAAAGCTATC TC TAG C TTGCGATTAAAACAAAACCTGAAAAACTAAAAGAAAAACGATTATTACC A 
ATAGATATTACACCAAAATCTGGTAGTAGAAAAATGAATGGTAGTAAGGTCACATTTTCTAAATCTAGTGACTTTGTA 
TTGTCTATTGATCCAAATGGCAAGTCTGAATTATTTGTCCAAGAGCGCTATAACGCCTTAAAAGCGAACTATCTTCGA 
CAGC TTAATGGT AAAGATTTTTATGCTTTC CC AC CAAAGAAGAAC AGTAGTAATTTTGAGC AGATAAATATGGTATTG 
AGAAATACAAAGATTGTTGAAGACATGGAAAAAGTAAAAGCAACAGAGAGGTTCTTACCAACTCATCCTACTGGTCTT 
CTCAAAACAGGAACAACTGATAGGCACCAAAAAACATTTGATTCACAAACAGATATTTCGTTTGGAAAGGACTTTATA 
GAGGTC AGAAT T C CGTGGC AGTTGTTGAATTTTT C TGAT C CATC AT CT CAAAAAATTC ACGATGATTACTTTAAACAT 
TATGGTGTGAAGGAGTTAGAAATTGAGAGC ATTGC TTTAGGATTAGGTGC TAATAGC AAAGAAAACACAC TGATAAAG 
ATGGCAGATTATCGTTTGAAAAATTGGGAGAGACCCGATACCAAAACCTTTTTAAAAGACTCCTATTATAGT 

SEQ ID NO. 2809: SAG1552 FROM THE JM9130013 GBS TYPE VIII STRAIN 

ACTTTGT TGTTAAAGGTGATACTGTACTTCAC AAGC C C AC CAATAAAC CTTTTGTTGTTAAAGGAGTAGACGTTGAGT 
CTTCCTTAGCGGGTTATCATCACAACGATTTTCCTATTACTCAAAAAACGTATCGTGAATGGTTCCATTTAATTTCCA 
ACATGGGGGCAAATACTGTAAGAGTCAAGGTACCGATGAATGTTGCATTTTACGATGCCTTATATCACCACAACAAAG 
CATCAAAGAGGCCACTGTATTTGTTGCAAGGAATACGTATAGATTCTTATCGCAATAATGCTTCTATAACAGCTTTTA 
ATGATAATTATAGGGGGTATTTAAAACGAGAAGCAAAAGGCGTTGTGGATATTCTCCATGGGCGTAAGCAAGTATGGA 
AT ACTGATTTTGGTAGC AGT C ATTATCATTATGAT C TTAGTCCTTGGGTACTTGGTTATGTCGTAGGGGATGATTGGA 
ATAGTGGTACTGTCGCTTATACTAATCATCAAGAGAAAAAAACGCAATATAAAGGACGTTATTTTAAAACTTCTGTGG 
CAGCTAATCCATTTGAGGTCATGCTAGCTCAAGTAATGGATGAATTGACACATTATGAGACAGCTAAATATGGTTGGC 
AACATTTGATTAGTTTTTCAAACTCACCAACAACAGACCCTTTTCATTATCGAAAACCATTTGAGGCACAGGCTCCTA 
AATACGTACAACTAAATGTAGAAAATATTCAAGCTAATTCGAATGTTAAAGCAGGTATGTTTGCAGCATATAAAGCTA 
TTGATTTC CATC CTCGATAC AAGGATTATC TATTAT TTGATAAAGAGAATATC AGTAAAGAAGATAGAC AAAAGATTA 
AAGAACTTTCTTTGTCACAGGGATACGTTAAACTGCTAAATGCTTATCACAAAATCCCTGTTCTAGTCACGGGTTATG 
GCTACT CGACAGCGAGAGGTATTGCCCAAAAAGAAATTGATAAACGTC CTCTGC CGATTAATGAAAAAGAAC AAGGTC 
AGCGTTTACTAGAAGATTATGAATCTTTTATATCATCCGGTAGTTTTGGAGCGACTATCAATGCATGGCAAGACGATT 
GGAATGC AAGGGTGTGGAATACAT C CTTCGC CAC AAATAAACATAGTC AATT C CTATGGGGGGATGC AC AAGTATTTA 
ATCAAGGTTATGGTTTATTAGGCTTTAAAAACGCAAAACATCATTATCAGGTTGATGGTAAAAGAGGCAAAGAAGAGT 
GGAAAC ATC CTCTGATGACTAGTG C AAC AGGAGATGACTTATATGC TAGC AGTGATGAAAGCTAT CTC TAC CTTGCGA 
TTAAAACAAAAC CTGAAAAAC T AAAAGAAAAACGAT TATT ACC AATAGAT ATTACAC CAAAATCTGGTAGTAGAAAAA 
TGAATGGTAGTAAGGT C AC ATTTT CTAAATC TAGTGACTTTGTATTGTCTATTGATC C AAATGGC AAGTCTGAATTAT 
TTGTCCAAGAGCGCTATAACGCCTTAAAAGCGAACTATCTTCGACAGCTTAATGGTAAAGATTTTTATGCTTTCCCAC 
CAAAGAAGAACAGTAGTAATTTTGAGCAGATAAATATGGTATTGAGAAATACAAAGATTGTTGAAGACATGGAAAAAG 
TAAAAGCAACAGAGAGGTTCTTACCAACTCATCCTACTGGTCTTCTCAAAACAGGAACAACTGATAGGCACCAAAAAA 
CATTTGATT CAC AAAC AGATATTT CGTTTGGAAAGGACTTT AT AGAGGTC AGAATT C CGTGGC AGTTGTTGAATTTTT 
CTGATC C AT CAT CTC AAAAAATTC ACGATGATTACTTTAAAC ATTATGGTGTGAAGGAGTT AGAAATTGAGAGC ATTG 
CTTTAGGATTAGGTGCTAATAGCAAAGAAAACACACTGATAAAGATGGCAGATTATCGTTTGAAAAATTGGGAGAGAC 
C CGAT AC C AAAAC C TTT TT AAAAG ACT C C TATT AT AGT AT T AAGAAAG 
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Table 28: Comparative Sequences relating to SAG1552 
(conserved hypothetical protein) 

SEQ ID NO. 2810: SAG1552 FROM THE M732 GBS TYPE III STRAIN 

TAC AAGAAC TAAC TTTGTTGTTAAAGGTGATACTGT ACT TCACAAGC C CAC CAATAAACC TTTTGTTGTTAAAGGAGT 
AGACGTTGAGTCTTCCTTAGCGGGTTATCATCACAACGATTTTCCTATTACTCAAAAAACGTATCGTGAATGGTTCCA 
TTTAATTTCCAACATGGGGGCAAATACTGTAAGAGTCAAGGTACCGATGAATGTTGCATTTTACGATGCCTTATATCA 
CCACAACAAAGAATCA^AGAGGCCACTGTATTTGTTGCAAGGAATACGTATAGATTCTTATCGCAATAATGCTTCTAT 
AACAGCTTTTAATGATAATTATAGGGGGTATTTAAAACGAGAAGCAAAAGGCGTTGTGGATATTCTCCATGGGCGTAA 
GC AAGTATGGAATACTGATTTTGGTAGCCGTCATTATC ATTATGAT CTTAGTC CTTGGGT AC TTGGTTATGT CGTAGG 
GGATGATTGCAATAGTGGTACTGTCGCTTATACTAATCATCAAGAGAAAAAAACGCAATATAAAGGACGTTATTTTAA 
AACTTCTGTGGCAGCTAATCCATTTGAGGTCATGCTAGCTCAAGTAATGGATGAATTGACACATTATGAGACAGCTAA 
ATATGGTTGGCAACATTTGATTAGTTTTTCAAACTCACCAACAACAGACCCTTTTCATTATCGAAAACCATTTGAGGC 
ACAGGCTCCTAAATACGTACAACTAAATGTAGAAAATATTCAAGCTAATTCAAATGTTAAAGCAGGTATGTTTGCAGC 
ATATAAAGCTATTGATTTCCATCCTCGATACAAGGATTATCTATTATTTGATAAAGAGAATATCAGTAAAGAAGATAG 
AC AAAAG AT T AAAGAAC T T T C T T T GT C AC AGGG AT AC GT TAAAC T G C T AAAT G C T T ATC AC AAAAT C C C TGT T C TAGT 
CACGGGTTATGGCTATTCGACAGCGAGAGGTATTGCCCAAAAAGAAATTGATAAACGTCCTCTGCCGATTAATGAAAA 
AGAACAAGqTCAGCGTTTACTAGAAGATTATGAATCTTTTATATCATCCGGTAGTTTTGGAGCGACTATCAATGCATG 
GCAAGACGATTGGAATGCAAGGGCGTGGAATACATCTTTCGCCACAAATAAACATAGTCAATTCCTATGGGGGGATGC 
ACAAGTATTTAATCAAGGTTATGGTTTATTAGGCTTTAAAAACGCAAAACATCATTATCAAGTTGATGGTAAAAGAGG 
CAAAGGAGAGTGGAAACATCCTCTGATGACTAGTGCAACAGGAGATGACTTATATGCTAGCAGTGATGAAAGCTATCT 
CTACCTTGCGATTAAAACAAAACCTGAAAAACTAAAAGAAAAACGATTATTACCAATAGATATTAC^CCAAAATCTGG 
TAGTAGAAAAATGAATGGTAGTAAGGTCACATTTTCTAAATCTAGTGACTTTGTATTGTCTATTGATCCAAATGGCAA' 
GTCTGAATTATTTGTCCAAGAGCGCTATAATGCCTTAAAAGCGAACTATCTTCGACAGCTTAACGGTAAAGATTTTTA 
TGCTTTCCCACCAAAGAAGAACAGTAGTAATTTTGAGCAGATAAATATGGTATTGAGAAATACAAAGATTGTTGAAGA 
CATGGAAAAAGTAAAAGCAACAGAGAGGTTCTTAC C AAC T CATCCTAC TGGT CTTCTC AAAAC AGGAACAACTGATAG 
GCACCAAAAAACATTTGATTCACAAACAGATATTTCGTTTGGAAAGGACTTTATAGAGGTCAGAATTCCGTGGCAGTT 
GTTGAATTTTTCTGAT C CAT CATC TC AAAAAATT CACGATGATTACTTTAAAC ATTATGGTGTGAAGGAGTT AGAAAT 
TGAGAGCATTGCTTTAGGATTAGGTGCTAATAGCAAAGAAAACACACTGATAAAGATGGCAGATTATCGTTTGAAAAA 
TTGGGAGAGACCCGATACCAAAACCTTTTTAAAAGACTCCTATTATAGTATTAAG 

SEQ ID NO. 2811: SAG1552 FROM THE M781 GBS TYPE III STRAIN 

TTTGATGGTAGT T TGTATTTAC C ACAGGGCTTATTAAAAGAAAATAC AAGAACTAACTTTGTTGTTAAAGGTGATACT 
GTACTTCACAAGCCCACCAATAAACCTTTTGTTGTTAAAGGAGTAGACGTTGAGTCTTCCTTAGCGGGTTATCATCAC 
AACGATTTTCCTATTACTCAAAAAACGTATCGTGAATGGTTCCATTTAATTTCCAACATGGGGGCAAATACTGTAAGA 
GTC AAGGTACCG ATGAATGTTGC ATTTTACGATGC C TTATAT CAC C AC AAC AAAGAATCAAAGAGGCCACTGTATTTG 
TTGC AAGGAATACGTATAGATTC T TAT CGCAATAATGC TTCT ATAACAGCTT TTAATGAT AATTATAGGGGGTATTTA 
AAACGAGAAGCAAAAGGCGTTGTGGATATTCTCCATGGGCGTAAGCAAGTATGGAATACTGATTTTGGTAGCCGTCAT 
TATC ATTATGATC TTAGTCCTTGGGTAC T TGGTTATGTCGTAGGGGATGATTGGAATAGTGGTACTGTCGC T TATACT 
AATCATCAAGAGAAAAAAACGCAATATAAAGGACGTTATTTTAAAACTTCTGTGGCAGCTAATCCATTTGAGGTCATG 
CTAGCTCAAGTAATGGATGAATTGACACATTATGAGACAGCTAAATATGGTTGGCAACATTTGATTAGTTTTTCAAAC 
TC AC CAACAAC AGACC CTTTTC ATTATCGAAAAC C ATTTGAGGC AC AGGC TCCTAAATACGTACAACTAAATGTAGAA 
AATATTCAAGCTAATTCAAATGTTAAAGCAGGTATGTTTGCAGCATATAAAGCTATTGATTTCCATCCTCGATACAAG 
GATTATCTATTATTTGATAAAGAGAATATCAGTAAAGAAGATAGACAAAAGATTAAAGAACTTTCTTTGTCACAGGGA 
T ACGTTAAAC TGCTAAATG C TTAT C ACAAAATC CCTGTTCTAGT CACGGGTTATGGCTAT TCGACAGCGAGAGGTATT 
GC CCAAAAAGAAATTGATAAACGT C CTCTGC CGATT AATGAAAAAGAACAAGGTCAGCGTT TACTAGAAGATTATGAA 
TCTTTTATATCATCCGGTAGTTTTGGAGCGACTATCAATGCATGGCAAGACGATTGGAATGCAAGGGCGTGGAATACA 
TCTTTCGCCACAAATAAACATAGTCAATTCCTATGGGGGGATGCACAAGTATTTAATCAAGGTTATGGTTTATTAGGC 
TTT AAAAACGCAAAACATC ATTAT C AAGT TGATGGTAAAAGAGGCAAAGGAGAGTGGAAAC ATC CT CTGATGACTAGT 
GCAACAGGAGATGACTTATATGCTAGCAGTGATGAAAGCTATCTCTACCTTGCGATTAAAACAAAACCTGAAAAACTA 
AAAGAAAAACGATTATTACCAATAGATATTACACCAAAATCTGGTAGTAGAAAAATGAATGGTAGTAAGGTCACATTT 
TCTAAATCTAGTGACTTTGTATTGTCTATTGATCCAAATGGCAAGTCTGAATTATTTGTCCAAGAGCGCTATAATGCC 
TTAAAAGCGAACTATC TTCGACAG C TTAACGGTAAAGATTT TTATGC TTT C CC ACC AAAG AAGAACAGTAGTAATTTT 
GAGCAGATAAATATGGTATTGAGAAATACAAAGATTGTTGAAGACATGGAAAAAGTAAAAGCAACAGAGAGGTTCTTA 
CCAAC TC AT C CT AC TGGTC TTCTC AAAACAGGAAC AACTGAT AGGC AC C AAAAAAC ATT TGATTC ACAAAC AGATATT 
TCGTTTGGAAAGGACTTTATAGAGGTCAGAATTCCGTGGCAGTTGTTGAATTTTTCTGATCCATCATCTCAAAAAATT 
CACGATGATTACTTTAAACATTATGGTGTGAAGGAGTTAGAAATTGAGAGCATTGCTTTAGGATTAGGTGCTAATAGC 
AAAGAAAAC AC ACTGATAAAGATGGC AGATTATCGTTTGAAAAATTGGGAGAGAC CCGAT AC CAAAACCTTT TTAAAA 
GACTC CTATTATAGTATTAAGAAAGAATGG 
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Table 28: Comparative Sequences relating to SAG1552 
(conserved hypothetical protein) 



SEQ2801 

SEQ2802 

SEQ2803 AAGGGCTTATTAAAAGAAAATACAAGAACT 

SEQ2 804 TATTAAAAGAAAATACAAGAACT 

SEQ2 805 AAGGGCTTATTAAAAGAAAATACAAGAACT 

SEQ2 806 ATTACTTTGATGGTAGTTTGTATTTACCAAAGGGCTTATTAAAAGAAAATACAAGAACT 

SEQ2807 TTTACCACAGGGCTTATTAAAAGAAAATACAAGAACT 

SEQ2 808 AAGGGGCTTATTAAAAGAAAATACAAGAACT 

SEQ2809 

SEQ2810 TACAAGAACT 

SEQ2 811 TTTGATGGTAGTTTGTATTTACCACAGGGCTTATTAAAAGAAAATACAAGAACT 

SEQ2 801 - -TTTGTTGTTAAAGGTGATACTGTACTTC^CAAGCCCACCAATAAACCTTTTGTTGTT 

SEQ2802 

SEQ2 803 ACTTTGTTGTTAAAGGTGATACTGTACTTCACAAGCCCACCAATAAACCTTTTGTTGTT 

SEQ2804 ACTTTGTTGTTAAAGGTGATACTGTACTTCACAAGCCCACCAATAAACCTTTTGTTGTT 

SEQ2 805 ACTTTGTTGTTAAAGGTGATACTGTACTTCACAAGCCCACCAATAAACCTTTTGTTGTT 

SEQ2 806 ACTTTGTTGTTAAAGGTGATACTGTACTTCACAAGCCCACCAATAAACCTTTTGTTGTT 

SEQ2 807 ACTTTGTTGTTAAAGGTGATACTGTACTTCACAAGCCCACCAATAAACCTTTTGTTGTT 

SEQ2 808 ACTTTGTTGTTAAAGGTGATACTGTACTTCACAAGCCCACCAATAAACCTTTTGTTGTT 

SEQ2 809 ACTTTGTTGTTAAAGGTGATACTGTACTTC^CAAGCCCACCAATAAACCTTTTGTTGTT 

SEQ2 810 ACTTTGTTGTTAAAGGTGATACTGTACTTCACAAGCCCACCAATAAACCTTTTGTTGTT 

SEQ2811 ACTTTGTTGTTAAAGGTGATACTGTACTTCACAAGCCCACCAATAAACCTTTTGTTGTT 

SEQ2 801 AAGGAGTAGACGTTGAGTCTTCCTTAGCAGGTTATCATCACAACGATTTTCCTATTACT 

SEQ2802 

SEQ2 803 AAGGAGTAGACGTTGAGTCTTCCTTAGCGGGTTATCATCACAACGATTTTCCTATTACT 

SEQ2 804 AAGGAGTAGACGTTGAGTCTTCCTTAGCGGGTTATCATCACAACGATTTTCCTATTACT 

SEQ2 805 AAGGAGTAGACGTTGAGTCTTCCTTAGCGGGTTATCATCACAACGATTTTCCTATTACT 

SEQ2 806 AAGGAGTAGACGTTGAGTCTTCCTTAGCGGGTTATCATCACAACGATTTTCCTATTACT 

SEQ2 807 AAGGAGTAGACGTTGAGTCTTCCTTAGCGGGTTATCATCACAACGATTTTCCTATTACT 

SEQ2808 AAGGAGTAGACGTTGAGTCTTCCTTAGCGGGTTATCATCACAACGATTTTCCTATTACT 

SEQ2809 AAGGAGTAGACGTTGAGTCTTCCTTAGCGGGTTATCATCACAACGATTTTCCTATTACT 

SEQ2810 AAGGAGTAGACGTTGAGTCTTCCTTAGCGGGTTATCATCACAACGATTTTCCTATTACT 

SEQ2811 AAGGAGTAGACGTTGAGTCTTC CTTAGCGGGTTAT CATCACAACGATTTTCCTATTACT 

SEQ2801 AAAAAACGTATCGTGAGTGGTTCCATTTAATTTCC^iTATGGGGGC^AATACTGTAAGA 

SEQ2802 

SEQ2 803 AAAAAACGTATCGTGAATGGTTCCATTTAATTTCC^CATGGGGGCAAATACTGTAAGA 

SEQ2 804 AAAAAACGTATCGTGAATGGTTCCATTTAATTTCCAACATGGGGGCAAATACTGTAAGA 

SEQ2 805 AAAAAACGTATCGTGAATGGTTCCATTTAATTTCCAACATGGGGGCAAATACTGTAAGA 

SEQ2 806 AAAAAACGTATCGTGAATGGTTCCATTTAATTTCCAACATGGGGGCAAATACTGTAAGA 

SEQ2 807 AAAAAACGTATCGTGAATGGTTCCATTTAATTTCCAACATGGGGGCAAATACTGTAAGA 

SEQ2 808 AAAAAACGTATCGTGAATGGTTCCATTTAATTTCCAACATGGGGGCAAATACTGTAAGA 

SEQ2 809 AAAAAACGTATCGTGAATGGTTCCATTTAATTTCCAACATGGGGGCAAATACTGTAAGA 

SEQ2810 AAAAAACGTATCGTGAATGGTTCCATTTAATTTCCAACATGGGGGCAAATACTGTAAGA 

SEQ2811 AAAAAACGTATCGTGAATGGTTCCATTTAATTTCCAACATGGGGGCAAATACTGTAAGA 

SEQ2 801 TCAAAGTACCGATGAATGTTGCATTTTACGATGCTTTATATCACCACAACAAAGCATCA 

SEQ2802 

SEQ2 803 TCAAGGTACCGATGAATGTTGCATTTTACGATGCCTTATATCACCACAACAAAGCATCA 

SEQ2 804 TCAAGGTACCGATGAATGTTGCATTTTACGATGCCTTATATCACCACAACAAAGCATCA 

SEQ2 805 TCAAGGTACCGATGAATGTTGCATTTTACGATGCCTTATATC^ 

SEQ2 806 TCAAGGTACCGATGAATGTTGCATTTTACGATGCCTTATATCACCACAACAAAGCATCA 

SEQ2 807 TCAAGGTACCGATGAATGTTGCATTTTACGATGCCTTATATCACCACAAC^ 

SEQ2 8 0 8 TCAAGGTACCGATGAATGTTGC^TTTTACGATGCCTTATATCACCACAACAAAGC^TCA 

SEQ2 809 TCAAGGTACCGATGAATGTTGCATTTTACGATGCCTTATATCACCACAACAAAGC^TCA 

SEQ2 810 TCAAGGTACCGATGAATGTTGCATTTTACGATGCCTTATATC!ACC^ 

SEQ2811 TCAAGGTACCGATGAATGTTGCATTTTACGATGCCTTATATCACCACAACAAAGAATCA 

SEQ2 801 AGAGGCCACTGTATTTGTTGCAAGGAATACGTATAGATTCTTATCGCAATAATGCTTCT 

SEQ2802 

SEQ2 803 AGAGGCCACTGTATTTGTTGCAAGGAATACGTATAGATTCTTATCGCAATAATGCTTCT 

SEQ2 8 0 4 AGAGGCCACTGTATTTGTTGCAAGGAATACGTATAGATTCTTATCGCAATAATGCTTCT 

SEQ2 805 AGAGGCGACTGTATTTGTTGCAAGGAATACGTATAGATTCTTATCGCAATAATGCTTCT 

SEQ2806 AGAGGCCACTGTATTTGTTGCAAGGAATACGTATAGATTCTTATCGCAATAATGCTTCT 

SEQ2 807 AGAGGCCACTGTATTTGTTGCAAGGAATACGTATAGATTCTTATCGCAATAATGCTTCT 

SEQ2 808 AGAGGC CACTGTATTTGTTGCAAGGAATACGTATAGAT TCTTAT CGCAATAATG CTTCT 
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SEQ2 809 AGAGGCCACTGTATTTGTTGCAAGGAATACGTATAGATTCTTATCGCAATAATGCTTCT 

SEQ2810 AGAGGCCACTGTATTTGTTGCAAGGAATACGTATAGATTCTTATCGCAATAATGCTTCT 

SEQ2 8 11 AGAGGCCACTGTATTTGTTGCAAGGAATACGTATAGATTCTTATCGCAATAATGCTTCT 

SEQ2 801 TAACAGCTTTTAATGATAATTATAGGGGGTATTTAAAACGAGAAGCAAAAGGCGTTGTG 

SEQ2802 

SEQ28 0 3 TAACAGCTTTTAATGATAATTATAGGGGGTATTTAAAACGAGAAGCAAAAGGCGTTGTG 

SEQ2 804 TAACAGCTTTTAATGATAATTATAGGGGGTATTTAAAACGAGAAGCAAAAGGCGTTGTG 

SEQ2 805 TAACAGCTTTTAATGATAATTATAGGGGGTATTTAAAACGAGAAGCAAAAGGCGTTGTG 

SEQ28 0 6 TAACAGCTTTTAATGATAATTATAGGGGGTATTTAAAACGAGAAGCAAAAGGCGTTGTG 

SEQ2 807 TAACAGCTTTTAATGATAATTATAGGGGGTATTTAAAACGAGAAGCAAAAGGCGTTGTG 

SEQ2 808 TAACAGCTTTTAATGATAATTATAGGGGGTATTTAAAACGAGAAGCAAAAGGCGTTGTG 

SEQ2 809 TAACAGCTTTTAATGATAATTATAGGGGGTATTTAAAACGAGAAGCAAAAGGCGTTGTG 

SEQ2 810 TAACAGCTTTTAATGATAATTATAGGGGGTATTTAAAACGAGAAGCAAAAGGCGTTGTG 

SEQ2 8 11 TAACAGCTTTTAATGATAATTATAGGGGGTATTTAAAACGAGAAGCAAAAGGCGTTGTG 

SEQ2801 - ATATTCTCCATGGGCGTAAGCAAGTATGGAATACTGATTTTGGTAGCCGTCATTATCAT 

SEQ2802 ATGACTA - GT G CAA CAGGAGATGAC T TAT AT - GCTAGCAGTGATGAAAGC 

SEQ2 803 ATATTCTCCATGGGCGTAAGCAAGTATGGAATACTGATTTGGGTAGCCGTCATTATCAT 

SEQ2 804 ATATTCTCCATGGGCGTAAGCAAGTATGGAATACTGATTTGGGTAGCCGTCATTATCAT 

SEQ2 805 ATATTCTCCATGGGCGTAAGCAAGTATGGAATACTGATTTGGGTAGCCGTCATTATCAT 

SEQ2 806 ATATTCTCCATGGGCGTAAGCAAGTATGGAATACAGATTTTGGTAGCCGTCATTATCAT 

SEQ2807 ATATTCTCCATGGGCGTAAGCAAGTATGGAATACTGATTTTGGTAGCCGTCATTATCAT 

SEQ2 808 ATATTCTCCATGGGCGTAAGCAAGTATGGAATACTGATTTTGGTAGCAGTCATTATCAT 

SEQ2 809 ATATTCTCCATGGGCGTAAGCAAGTATGGAATACTGATTTTGGTAGCAGTCATTATCAT 

SEQ2 810 ATATTCTC CATGGGCGTAAGCAAGTATGGAATACTGATTTTGGTAG CCGTCATTATCAT 

SEQ2 811 ATATTCTC CATGGGCGTAAGCAAGTATGGAATACTGATTTTGGTAG CCGTCATTATCAT 

SEQ2 801 TATGATCTTAGTCCTTGGGTACTTGGTTATGTCGTAGGGGATGATTGGAATAGTGGT-AC 

SEQ2 802 TAT — CTCTA- - CCTTGCG- ATTAAAACAAAACCTGAAAAACTAAAAGAAAAACGATTAT 

SEQ2803 TATGATCTTAGTCCTTGGGTACTTGGTTATGTCGTAGGGGATGATTGGAATAGTGGT - AC 

SEQ2 804 TATGATCTTAGTCCTTGGGTACTTGGTTATGTCGTAGGGGATGATTGGAATAGTGGT- AC 

SEQ2805 TATGATCTTAGTCCTTGGGTACTTGGTTATGTCGTAGGGGATGATTGGAATAGTGGT-AC 

SEQ2 806 TATGATCTTAGTCCTTGGGTACTTGGTTATGTCGTAGGGGATGATTGGAATAGTGGT - AC 

SEQ2807 TATGATCTTAGTCCTTGGGTACTTGGTTATGTCGTAGGGGATGATTGGAATAGTGGT - AC 

SEQ2 808 TATGATCTTAGTCCTTGGGTACTTGGTTATGT CGTAGGGGATGATGGACATAGTGGT - AC 

SEQ2809 TATGATCTTAGTCCTTGGGTACTTGGTTATGTCGTAGGGGATGATTGGAATAGTGGT-AC 

SEQ2 81 0 TATGATCTTAGTCCTTGGGTACTTGGTTATGTCGTAGGGGATGATTGCAATAGTGGT-AC 

SEQ2811 TATGATCTTAGTCCTTGGGTACTTGGTTATGTCGTAGGGGATGATTGGAATAGTGGT-AC 



SEQ2801 
SEQ2802 
SEQ2803 
SEQ2804 
SEQ2 805 
SEQ2806 
SEQ2807 
SEQ2808 
SEQ2809 
SEQ2810 
SEQ2811 

SEQ2801 
SEQ2802 
SEQ2803 
SEQ2804 
SEQ2805 
SEQ2806 
SEQ2807 
SEQ2 80 8 
SEQ2809 
SEQ2 810 
SEQ2811 

SEQ2801 
SEQ2802 
SEQ2803 
SEQ2804 



TGTCGCTT - ATACTAATCATCAAGAGAA- AAAAACGCAATATAAAGGAC - GTTATTTTAA 
TACCAATAGATATTA- - CAC CAAAAT C TGGTAGTAGAAAAAT GAATGGTAGTAAGGT CA C 
TGTCGCTT - ATACTAATCATCAAGAGAA- AAAAACGCAATATAAAGGAC - GTTATTTTAA 
TGTCGCTT - ATACTAATCATCAAGAGAA - AAAAACGCAATATAAAGGAC- GTTATTTTAA 
TGTCGCTT - ATACTAATCATCAAGAGAA - AAAAACGCAATATAAAGGAC - GTTATTTTAA 
TGTCGCTT - ATACTAATCATCAAGAGAA- AAAAACGCAATATAAAGGAC - GTTATTTTAA 
TGTCGCTT - ATACTAATCATCAAGAGAA- AAAAACGCAATATAAAGGAC - GTTATTTTAA 
TGTCGCTTTATACTAATCATCAAGAGGAGAAAAACGCAATATAAAGGAC - GTTATTTTAA 
TGTCGCTT -ATACTAATCATCAAGAGAA- AAAAACGCAATATAAAGGAC -GTTATTTTAA 
TGTCGCTT - ATACTAATCATCAAGAGAA- AAAAACGCAATATAAAGGAC - GTTATTTTAA 
TGTCGCTT - ATACTAATCATCAAGAGAA- AAAAACGCAATATAAAGGAC - GTTATTTTAA 

AACTTCTGCGGCAGCTAATCCATTTGAGGTCATGCTAGCTCAAGTTATGGAT - -GAATTG 
ATTTTCTAAATCTAGTGA- CTTTGTATTGTC- TATTGATCCAAATGGCAAGTCTGAATT- 
AACTTCTGTGGCAGCTAATCCATTTGAGGTCATGCTAGCTCAAGTAATGGAT - - GAATTG 
AACTTCTGTGGCAGCTAATCCATTTGAGGTCATGCTAGCTCAAGTAATGGAT - - GAATTG 
AACTTCTGTGGCAGCTAATCCATTTGAGGTCATGCTAGCTCAAGTAATGGAT - - GAATTG 
AACTTCTGTGGCAGCTAATCCATTTGAGGTCATGCTAGCTCAAGTAATGGAT - -GAATTG 
AACTTCTGTGGCAGCTAATCCATTTGAGGTCATGCTAGCTCAAGTAATGGAT - - GAATTG 
AACTTCTGTGGCAGCTAATCCATTTGAGGTCATGCTAGCTCAAGTAATGGAT - - GAATTG 
AACTTCTGTGGCAGCTAATCCATTTGAGGTCATGCTAGCTCAAGTAATGGAT - - GAATTG 
AACTTCTGTGGCAGCTAATCCATTTGAGGTCATGCTAGCTCAAGTAATGGAT- -GAATTG 
AACTTCTGTGGCAGCTAATCCATTTGAGGTCATGCTAGCTCAAGTAATGGAT - - GAATTG 

ACAC^TTATGAGAC^GCTAAATATGGTTGGCAAC^TTTGATTAGTTTTTCAAACTCACCA 
ATTTGTC- CAAGAGCGCTATA-ATGCCT - - TAAAAGCGAACTATCTTCGACAGCTTAACG 
ACACATTATGAGAC^GCTAAATATGGTTGGCAACATTTGATTAGTTTTTCAAACTCACCA 
ACACATTATGAGACAGCTAAATATGGTTGGCAACATTTGATTAGTTTTTCAAACTCACCA 
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SEQ2 805 ACAC^TTATGAGACAGCTAAATATGGTTGGCAACATTTGATTAGTTTTTCAAACTCACCA 



SEQ2 806 ACACATTATGAGACAGCTAAATATGGTTGGCAACATTTGATTAGTTTTTCAAACTCACCA 

SEQ2 807 ACACATTATGAGACAGCTAAATATGGTTGGC^C^TTTGATTAGTTTTTC^U^CTCACCA 

SEQ2 808 ACACATTATGAGACAGCTAAATATGGTTGGCAACATTTGATTAGTTTTTCAAACTCACCA 

SEQ2 809 AC^CATTATGAGACAGCTAAATATGGTTGGCAACATTTGATTAGTTTTTCAAACTCACCA 

SEQ2810 ACACATTATGAGACAGCTAAATATGGTTGGCAACATTTGATTAGTTTTTCAAACTCACCA 

SEQ2 811 ACACATTATGAGACAGCTAAATATGGTTGGCAACATTTGATTAGTTTTTCAAACTCACCA 

SEQ2 801 CAACAGAC CCTTTTCGTTATCGAAAACCATTTG- AGGCACAGGCTCCTAAATA 

SEQ2 802 TAAAGATTTTTATGCTTTCCCACCAAAGAAGAACAGTAGTAATTTTGAGCAGATCAATA 

SEQ2803 CAACAGAC CCTTTTCATTATCGAAAACCATTTG -AGGCACAGGCTCCTAAATA 

SEQ2 804 CAACAGAC CCTTTTCATTATCGAAAACCATTTG- AGGCACAGGCTCCTAAATA 

SEQ2 805 CAACAGAC CCTTTTCATTATCGAAAACCATTTG- AGGCACAGGCTCCTAAATA 

SEQ2 806 CAACAGAC CCTTTTCATTATCGAAAACCATTTG- AGGCACAGGCTCCTAAATA 

SEQ2 807 CAACAGAC CCTTTTCATTATCGAAAACCATTTG- AGGCACAGGCTCCTAAATA 

SEQ2 808 CAACAGAC CCTTTTCATTATCGAAAACCATTTG- AGGCACAGGCTCCTAAATA 

SEQ2 809 CAACAGAC CCTTTTCATTATCGAAAACCATTTG-AGGCACAGGCTCCTAAATA 

SEQ2 810 CAACAGAC CCTTTTCATTATCGAAAACCATTTG - AGGCACAGGCTCCTAAATA 

SEQ2811 CAACAGAC CCTTTTCATTATCGAAAACCATTTG- AGGCACAGGCTCCTAAATA 

SEQ2 801 G- TACAACTAAATGTAGAAAATATTCAAGCTAATTCGAATGTTAAAGCA GGTATTT 

SEQ2 802 GGTATTGAaAAATACAAAGATTGTTGAAGACATGGAAAAAGTAAAAGCAACAaAGAGGT 

SEQ2 803 G-TACAACTAAATGTAGAAAATATTCAAGCTAATTCAAATGTTAAAGCA GGTATGT 

SEQ2 804 G - TACAACTAAATGTAGAAAATATTCAAGCTAATTCAAATGTTAAAGCA GGTATGT 

SEQ2 805 G - TACAACTAAATGTAGAAAATATTCAAGCTAATTCAAATGTTAAAGCA GGTATGT 

SEQ2 806 G-TACAACTAAATGTAGAAAATATTCAAGCTAATTCAAATGTTAAAGCA GGTATGT 

SEQ2 807 G - TACAACTAAATGTAGAAAATATTCAAGCTAATTCAAATGTTAAAGCA GGTATGT 

SEQ2 808 G - TACAACTAAATGTAGAAAATATTCAAGCTAATTCGAATGTTAAAGCA GGTATGT 

SEQ2 809 G - TACAACTAAATGTAGAAAATATTCAAGCTAATTCGAATGTTAAAGCA GGTATGT 

SEQ2 810 G- TACAACTAAATGTAGAAAATATTCAAGCTAATTCAAATGTTAAAGCA GGTATGT 

SEQ2 811 G- TACAACTAAATGTAGAAAATATTCAAGCTAATTCAAATGTTAAAGCA GGTATGT 

SEQ2 801 TTGCAGCATATAAAGCTATTGATTTCCATCCTCGATACAAGGATTATCTATTATTTGATA 

SEQ2802 TCTTACCAACTCATCCTACTGGTCTT CTCAAAACAGGAACAATTGAT -AGGCACCA 

SEQ2 803 TTGCAGCATATAAAGCTATTGATTTCCATCCTCGATACAAGGATTATCTATTATTTGATA 

SEQ2804 TTGCAGCATATAAAGCTATTGATTTCCATCCTCGATACAAGGATTATCTATTATTTGATA 

SEQ2 805 TTGCAGCIATATAAAGCTATTGATTTCCATCCTCGATACAAGGATTATCTATTATTTGATA 

SEQ2 806 T T G CAG CA TATAAAG C TAT TGAT T T C CAT C CT CGATACAAGGAT TATC TAT TAT TTGAT A 

SEQ2 807 TTGCAGCATATAAAGCTATTGATTTCCIATCCTCGATACAAGGATTATCTATTATTTGATA 

SEQ2 808 TTGCAGCATATAAAGCTATTGATTTCC!ATCCTCGATACAAGGATTATCTATTATTTGATA 

SEQ2 809 TTGCAGCATATAAAGCTATTGATTTCCATCCT CGATACAAGGATTATCTAT TATTTGATA 

SEQ2 810 TTGCAGCATATAAAGCTATTGATTTCCATCCTCGATACAAGGATTATCTATTATTTGATA 

SEQ2 811 TTGCIAGCATATAAAGCTATTGATTTCCATCCTCGATACAAGGATTATCTATTATTTGATA 

SEQ2 801 AAGAGAATAT CAGT AAAGAAGATAGACAAAAGAT T -AAAGAACTTTCTTTGTCACAGGGA 

SEQ2 802 AAAAACATTTGATTCACAAACAGATATTTCGTTTGGAAAGGACTTTATAGAGGTCAGAAT 

SEQ2 803 AAGAGAATAT CAGT AAAGAAGATAGACAAAAGATT - AAAGAACTTTCTTTGTCACAGGGA 

SEQ2 804 AAGAGAATATCAGTAAAGAAGATAGACAAAAGATT - AAAGAACTTTCTTTGTCACAGGGA 

SEQ2805 AAGAGAATATCAGTAAAGAAGATAGACAAAAGATT -AAAGAACTTTCTTTGTCACAGGGA 

SEQ2 806 AAGAGAATATCAGTAAAGAAGATAGACAAAAGATT - AAAGAACTTTCTTTGTCACAGGGA 

SEQ2807 AAGAGAATAT CAGTAAAGAAGATAGACAAAAGATT - AAAGAACTTTCTTTGTCACAGGGA 

SEQ2808 AAGAGAATAT CAGTAAAGAAGATAGACAAAAGATT - AAAGAACTTTCTTTGTCACAGGGA 

SEQ2809 AAGAGAATATCAGTAAAGAAGATAGACAAAAGATT -AAAGAACTTTCTTTGTCACAGGGA 

SEQ2 810 AAGAGAATATCAGTAAAGAAGATAGAaAAAAGATT -AAAGAACTTTCTTTGTCACAGGGA 

SEQ2811 AAGAGAATATCAGTAAAGAAGATAGAC?^UU\GATT-AAAGAACTTTCTTTGTCACAGGGA 

SEQ2 801 TACGTTA- AACTGCTAAATGCTTATCACAAAATCCCTGTTCTAGTCACGGGTTATGGCTA 

SEQ2 802 TCCGTGGCAGTTGTTGAATTTTTCTGATCCA TCATCTCAAAAAATTCACGATGATTA 

SEQ2 803 TACGTTA -AACTGCTAAATGCTTATCACAAAATCCCTGTTCTAGTCACGGGTTATGGCTA 

SEQ2 8 0 4 TACGTTA -AACTGCTAAATGCTTATCACAAAATCCCTGTTCTAGTCACGGGTTATGGCTA 

SEQ2 805 TACGTTA- AACTGCTAAATGCTTATCACAAAATCCCTGTTCTAGTCACGGGTTATGGCTA 

SEQ2 806 TACGTTA- AACTGCTAAATGCTTATCACAAAATCCCTGTTCTAGTCACGGGTTATGGCTA 

SEQ2 807 TACGTTA -AACTGCTAAATGCTTATCACAAAATCCCTGTTCTAGTCACGGGTTATGGCTA 

SEQ280 8 TACGTTA - AACTGCTAAATGCTTATCACAAAATCCCTGTTCTAGTCACGGGTTATGGCTA 

SEQ2 809 TACGTTA- AACTGCTAAATGCTTATCACAAAATCCCTGTTCTAGTCACGGGTTATGGCTA 

SEQ2 81 0 TACGTTA- AACTGCTAAATGCTTATCACAAAATCCCTGTTCTAGTCACGGGTTATGGCTA 

SEQ2811 TACGTTA -AACTGCTAAATGCTTATCACAAAATCCCTGTTCTAGTCACGGGTTATGGCTA 
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SEQ2801 
SEQ2802 
SEQ2 803 
SEQ2804 
SEQ2805 
SEQ2806 
SEQ2807 
SEQ2808 
SEQ2 80 9 
SEQ2 810 
SEQ2811 

SEQ2 801 
SEQ2802 
SEQ2803 
SEQ2 804 
SEQ2 805 
SEQ2 80 6 
SEQ2807 
SEQ2 808 
SEQ2809 
SEQ2810 
SEQ2 811 

SEQ2801 
SEQ2802 
SEQ2803 
SEQ2 804 
SEQ2 805 
SEQ2 80 6 
SEQ2 807 
SEQ2 80 8 
SEQ280 9 
SEQ2810 
SEQ2811 

SEQ280X 
SEQ2802 
SEQ2803 
SEQ2804 
SEQ2805 
SEQ2806 
SEQ2807 
SEQ2808 
SEQ2809 
SEQ2810 
SEQ28X1 

SEQ2801 
SEQ2802 
SEQ2 803 
SEQ2804 
SEQ2805 
SEQ2806 
SEQ2807 
SEQ2808 
SEQ2 809 
SEQ2810 
SEQ2811 

SEQ280X 
SEQ2 802 
SEQ2803 
SEQ2 804 
SEQ2 805 
SEQ2806 
SEQ2 807 
SEQ2 808 



TCGACAGCGAGAGGTATTGCCCAAAAAGAAATTGATAAACGTCCTCTGCCGATTAATGA 
TTTAAACATTATGGTGTGAAGGAGTTAGAAATTGA - GAGCATTGCTTTAGGATTAGGTG 
TCGACAGCGAGAGGTATTGCCCAAAAAGAAATTGATAAACGTCCTCTGCCGATTAATGA 
TCGACAGCGAGAGGTATTGCCCAAAAAGAAATTGATAAACGTCCTCTGCCGATTAATGA 
TCGACAGCGAGAGGTATTGCCCAAAAAGAAATTGATAAACGTCCTCTGCCGATTAATGA 
TCGACAGCGAGAGGTATTGCCCAAAAAGAAATTGATAAACGTCCTCTGCCGATTAATGA 
TCGACAGCGAGAGGTATTGCCCAAAAAGAAATTGATAAACGTCCTCTGCCGATTAATGA 
TCGACAGCGAGAGGTATTGCCCAAAAAGAAATTGATAAACGTCCTCTGCCGATTAATGA 
TCGACAGCGAGAGGTATTGCCCAAAAAGAAATTGATAAACGTCCTCTGCCGATTAATGA 
TCGACAGCGAGAGGTATTGCCCAAAAAGAAATTGATAAACGTCCTCTGCCGATTAATGA 
TCGACAGCGAGAGGTATTGCCCAAAAAGAAATTGATAAACGTCCTCTGCCGATTAATGA 

AAAGAA CAAGGT CAG C GT T TAC TAGAAGAT T ATGAAT C T T T TATAT CAT CCGGTAGTT T 

TAATAG CAAAGAAAACACAC TG ATAAAGATGG CAGAT TATCGTTTGAAAAATT 

AAAGAACAAGGTCAGCGTTTACTAGAAGATTATGAATCTTTTATATCATCCGGTAGTTT 
AAAGAACAAGGT CAG C GT T TAC TAGAAGAT T ATGAAT C TT T TATAT CAT CCGGTAGTT T 
AAAGAACAAGGTCAGCGTTTACTAGAAGATTATGAATCTTTTATATCATCCGGTAGTTT 
AAAGAACAAGGTCAGCGTTTACTAGAAGATTATGAATCTTTTATATCATCCGGTAGTTT 
AAAGAACAAGGTCAGCGTTTACTAGAAGATTATGAATCTTTTATATCATCCGGTAGTTT 
AAAGAACAAGGTCAGCGTTTACTAGAAGATTATGAATCTTTTATATCATCCGGTAGTTT 
AAAGAA(2AAGGTCAGCGTTTACTAGAAGATTATGAATCTTTTATATCATCCGGTAGTTT 
AAAGAACAAGGTCAGCGTTTACTAGAAGATTATGAATCTTTTATATCATCCGGTAGTTT 
AAAGAACAAGGTCAGCGTTTACTAGAAGATTATGAATCTTTTATATCATCCGGTAGTTT 

GGAGCGACTATCAATGC^TGGCIAAGACGATTGGAATGCAAGGGCGTGGAATACATCCTT 

GGAGAGAC - - CCGATAC CAAAACCTTTTTAA AAGACTCCTATTATAGTATT 

GGAGCGACTAT CAATGCATGGCAAGACGAT TGGAATGCAAGGGCGTGGAATACATCTTT 
GGAGCGACTATCAATGCATGGCAAGACGATTGGAATGCAAGGGCGTGGAATACATCTTT 
GGAGCGACTATCAATGCATGGCAAGACGATTGGAATGCAAGGGCGTGGAATACATCTTT 
GGAGCGACTAT CAATGCATGGCAAGACGATTGGAATGCAAGGGCGTGGAATACATCTTT 
G^AGCGACTATCAATGCATGGCAAGACGATTGGAATGCAAGGGCGTGGAATACATCTTT 
GGAGCGACTATCAATGCATGGCAAGACGATTGGAATGCAAGGGTGTGGAATACATCCTT 
GGAGCGACTATCAATGCATGGCAAGACGATTGGAATGCAAGGGTGTGGAATACATCCTT 
GGAGCGACTAT CAATGCATGGCAAGACGAT TGGAATGCAAGGGCGTGGAATACATCTTT 
GGAGCGACTATCAATGCATGGCAAGACGATTGGAATGCAAGGGCGTGGAATACATCTTT 

GCCACAAATAAACATAGTCAATTCCTATGGGGGGATGC^^ 

A- -AGAAAGAA 

GCCACAAATAAACATAGTCAATTCCTATGGGGGGATGCACAAGTATTTAATa^AGGTTA 

GCCACAAATAAACATAGTCAATTCCTATGGGGGGATGCACAAGTATTTAATC^ 

GCCAGAAATAAACATAGTCAATTCCTATGGGGGGATGCACAA 

GCCACAAATAAACATAATCAATTCCTATGGG<3GGATGCACAAGTATTTAATC^ 

GCCACAAATAAACATAGTCAATTCCTATGGGGGGATGCACAAGTATTTAATCA 

GC(^CAAATAAACATAGTCAATTCCTATGGGGGGATGCACAAGTATTTAATCAAGGTTA 

GCCACAAATAAACATAGTCAATTCCTATGGGGGGAT^ 

GCCACAAATAAACATAGTCAATTCCTATGGGGGGATGCACAAGTATTTA 

GC(ZACAAATAAACATAGTCAATTCCTATGGGGGGATGCACAAGTATTTAATCAAGGTTA 

GGT T TAT TAGG C T T TAAAAAC G CAAAA CAT CAT TAT CAAGT TGATGGT AAAAGAGG CAA 

GGTTTATTAGGCTTTAAAAACGCAAAACATCATTATCAAGTTGATGGTAAAAGAGGC^A 
GGTTTATTAGGCTTTAAAAACGCAAAAC^TCATTATCMGTTGATGGTAAAAGAGGCZAA 
GGTTTATTAGGCTTTAAAAACGCAAAACATCATTATCAAGTTGATGGTAAAAGAGGCAA 
GGTTTATTAGGCTTTAAAAACGCAAAACATCATTATCAAGTTGATGGTAAAAGAGGCAA 
GGT TTATTAGGCTTTAAAAACGCAAAACAT CATTATCAAGTTGATGGTAAAAGAGG CAA 
GGTTTATTAGGCTTTAAAAACGCAAAACATCATTATCAGGTTGATGGTAAAAGAGGCAA 
GGTTTATTAGGCTTTAAAAACGCAAAACATCATTATCAGGTTGATGGTAAAAGAGGCAA 
GGTTTATTAGGCTTTAAAAACG(^y\AC^TC^TTAT(^UVGTTGATGGTAAAAGAGGC^ 
G<3TTTATTAGGCTTTAAAAACGC!AAAA(^TCATTATCA^ 



GGAGAGTGGAAACATCCTCTG • 



GGAGAGTGGAAACATCCTCTGATGACTAGTGCAACAGGAGATGACTTATATGCTAGCAG 
GGAGAGTGGAAACATCCT CTGATGACTAGTGCAACAGGAGATGACT TATATGCTAGCAG 
GGAGAGTGGAAACATCCTCTGATGACTAGTGCAACAGGAGATGACTTATATGCTAGCAG 
GGAGAGTGGAAACATCCTCTGATGACTAGTGCAACAGGAGATGACTTATATGCTAGCAG 
GGAGAGTGGAAACATCCTCTGATGACTAGTGCAACAGGAGATGACTTATATGCTAGCAG 
GAAGAGTGGAAACATCCTCTGATGACTAGTGCAACAGGAGATGACTTATATGCTAGCAG 
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SEQ2809 
SEQ2 810 
SEQ2 811 

SEQ2801 
SEQ2802 
SEQ2803 
SEQ2804 
SEQ2805 
SEQ2806 
SEQ2807 
SEQ2808 
SEQ2 80 9 
SEQ2810 
SEQ2 811 

SEQ2801 
SEQ2802 
SEQ2803 
SEQ2804 
SEQ2805 
SEQ280 6 
SEQ2807 
SEQ2808 
SEQ2809 
SEQ2810 
SEQ2811 

SEQ280X 
SEQ2 802 
SEQ2 803 
SEQ2804 
SEQ2805 
SEQ2806 
SEQ2807 
SEQ2 808 
SEQ2809 
SEQ2810 
SEQ2811 

SEQ2801 
SEQ2802 
SEQ2 803 
SEQ2 804 
SEQ2805 
SEQ2 80 6 
SEQ2807 
SEQ2 80 8 
SEQ2 809 
SEQ2810 
SEQ2811 

SEQ2801 
SEQ2802 
SEQ2803 
SEQ2 804 
SEQ2 805 
SEQ2 806 
SEQ2807 
SEQ2808 
SEQ280 9 
SEQ2810 
SEQ2 811 

SEQ2 801 
SEQ2 802 
SEQ2803 
SEQ2804 



Table 28: Comparative Sequences relating to SAG1552 

(conserved hypothetical protein) 

GAAGAGTGGAAACATCCTCTGATGACTAGTGCAACAGGAGATGACTTATATGCTAGCAG 
GGAGAGTGGAAACATCCTCTGATGACTAGTGCAACAGGAGATGACTTATATGCTAGCAG 
GGAGAGTGGAAACATCCTCTGATGACTAGTGCAACAGGAGATGACTTATATGCTAGCAG 



GATGAAAGCTATCTCTACCTTGCGATTAAAACAAAACCTGAAAAACTAAAAGAAAAACG 
GATGAAAGCTATCTCTACCTTGCGATTAAAACAAAACCTGAAAAACTAAAAGAAAAACG 
GATGAAAGCTATCTCTACCTTGCGATTAAAACAAAACCTGAAAAACTAAAAGAAAAACG 
GATGAAAGCTATCTCTACCTTGCGATTAAAACAAAACCTGAAAAACTAAAAGAAAAACG 
GATGAAAGCTATCTCTACCTTGCGATTAAAACAAAACCTGAAAAACTAAAAGAAAAACG 
GATGAAAGCTATCTCTACCTTGCGATTAAAACAAAACCTGAAAAACTAAAAGAAAAACG 
GATGAAAGCTATCTCTACCTTGCGATTAAAACAAAACCTGAAAAACTAAAAGAAAAACG 
GATGAAAGCTATCTCTACCTTGCGATTAAAACAAAACCTGAAAAACTAAAAGAAAAACG 
GATGAAAGCTATCTCTACCTTGCGATTAAAACAAAACCTGAAAAACTAAAAGAAAAACG 



TTATTACCAATAGATATTACACCAAAATCTGGTAGTAGAAAAATGAATGGTAGTAAGGT 
TTATTACCAATAGATATTAC1ACCAAAATCTGGTAGTAGAAAAATGAATGGTAGTAAGGT 
TTATTACCAATAGATATTACACCAAAATCTGGTAGTAGAAAAATGAATGGTAGTAAGGT 
TTATTACCAATAGATATTACACCAAAATCTGGTAGTAGAAAAATGAATGGTAGTAAGGT 
TTATTACCAATAGATATTACACCAAAATCTGGTAGTAGAAAAATGAATGGTAGTAAGGT 
TTATTACCAATAGATATTACACCAAAATCTGGTAGTAGAAAAATGAATGGTAGTAAGGT 
TTATTACCAATAGATATTACACCAAAATCTGGTAGTAGAAAAATGAATGGTAGTAAGGT 
TTATTACCAATAGATATTAC^CCAAAATCTGGTAGTAGAAAAATGAATGGTAGTAAGGT 
TTATTACCAATAGATATTACACCAAAATCTGGTAGTAGAAAAATGAATGGTAGTAAGGT 



ACATTTTCTAAAT CTAGTGACTTTGTATTGTCTATTGATC CAAATGGCAAGTCTGAATT 
ACATTTTCTAAATCTAGTGACTTTGTATTGTCTATTGATCCAAATGGCAAGTCTGAATT 
ACATTTTCTAAATCTAGTGACTTTGTATTGTCTATTGATCCAAATGGCAAGTCTGAATT 
ACATTTTCTAAATCTAGTGACTTTGTATTGTCTATTGATCCAAATGGCAAGTCTGAATT 
ACATTTTCTAAATCTAGTGACTTTGTATTGTCTATTGATCCAAATGGCAAGTCTGAATT 
ACATTTTCTAAATCTAGTGACTTTGTATTGTCTATTGATCCAAATGGCAAGTCTGAATT 
ACATTTTCTAAATCTAGTGACTTTGTATTGTCTATTGATCCAAATGGCAAGTCTGAATT 
ACATTTTCTAAATCTAGTGACTTTGTATTGTCTATTGATCCAAATGGCAAGTCTGAATT 
ACATTTTCTAAATCTAGTGACTTTGTATTGTCTATTGATCCAAATGGCAAGTCTGAATT 



TTTGTCCAAGAGCGCTATAATGCCTTAAAAGCGAACTATCTTCGACAGCTTAACGGTAA 
TTTGTCCAAGAGCGCTATAATGCCTTAAAAGCGAACTATCTTCGACAGCTTAACGGTAA 
TTTGTCCAAGAGCGCTATAATGCCTTAAAAGCGAACTATCTTCGACAGCTTAACGGTAA 
TTTGTCCAAGAGCGCTATAATGCCTTAAAAGCGAACTATCTTCGACAGCTTAACGGTAA 
TTTGTCCAAGAGCGCTATAATGCCTTAAAAGCGAACTATCTTCGACAGCTTAACGGTAA 
TTTGTCCAAGAGCGCTATAACGCCTTAAAAGCGAACTATCTTCGACAGCTTAATGGTAA 
TTTGTCCAAGAGCGCTATAACGCCTTAAAAGCGAACTATCTTCGACAGCTTAATGGTAA 
TTTGTCCAAGAGCGCTATAATGCCTTAAAAGCGAACTATCTTCGACAGCTTAACGGTAA 
TTTGTCCAAGAGCGCTATAATGCCTTAAAAGCGAACTATCTTCGACAGCTTAACGGTAA 



GATTTTTATGCTTTCCCACCAAAGAAGAACAGTAGTAATTTTGAGCAGATAAATATGGT 
GATTTTTATGCTTTCCCACCAAAGAAGAACAGTAGTAATTTTGAGCAGATAAATATGGT 
GAT T TT TATG C TT T C CCAC CAAAG AAG AACAGT AGTAAT T T T GAGCAGATAAAT AT GGT 
GATTTTTATGCTTTGCCACCAAAGAAGAACAGTAGTAATTTTGAGCAGATAAATATGGT 
GATTTTTATGCTTTCCCACCAAAGAAGAACAGTAGTAATTTTGAGCAGATAAATATGGT 
GATTTTTATGCTTTCCCACCAAAGAAGAACAGTAGTAATTTTGAGCAGATAAATATGGT 
GATTTTTATGCTTTCCCACCAAAGAAGAACAGTAGTAATTTTGAGCAGATAAATATGGT 
GATTTTTATGCTTTCCCACCAAAGAAGAACAGTAGTAATTTTGAGCAGATAAATATGGT 
GATTTTTATGCTTTCCCACCAAAGAAGAACAGTAGTAATTTTGAGCAGATAAATATGGT 



TTGAGAAATACAAAGATTGTTGAAGACATGGAAAAAGTAAAAGCAACAGAGAGGTTCTT 
TTGAGAAATACAAAGATTGTTGAAGACATGGAAAAAGTAAAAGCAACAGAGAGGTTCTT 
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SEQ2 805 
SEQ2806 
SEQ2807 
SEQ2808 
SEQ2809 
SEQ2810 
SEQ2811 

SEQ2801 
SEQ2802 
SEQ2803 
SEQ2804 
SEQ2805 
SEQ2806 
SEQ2807 
SEQ2808 
SEQ2 80 9 
SEQ2810 
SEQ2811 

SEQ2 801 
SEQ2802 
SEQ2803 
SEQ2804 
SEQ2805 
SEQ2806 
SEQ2807 
SEQ2808 
SEQ2809 
SEQ2810 
SEQ2811 

SEQ2801 
SEQ2802 
SEQ2803 
SEQ2804 
SEQ2805 
SEQ280 6 
SEQ2807 
SEQ280 8 
SEQ2809 
SEQ2810 
SEQ2811 

SEQ2801 
SEQ2802 
SEQ2803 
SEQ2 804 
SEQ2805 
SEQ2806 
SEQ2807 
SEQ2808 
SEQ280 9 
SEQ2810 
SEQ2 811 

SEQ2 801 
SEQ2 802 
SEQ2 803 
SEQ2 804 
SEQ2805 
SEQ2806 
SEQ2807 
SEQ2 80 8 
SEQ2 809 
SEQ2 810 
SEQ2 811 



TTGAGAAATACAAAGATTGTTGAAGACATGGAAAAAGTAAAAGCAACAGAGAGGTTCTT 
TTGAGAAATACAAAGATTGTTGAAGACATGGAAAAAGTAAAAGCAACAGAGAGGTTCTT 
TTGAGAAATACAAAGAT TGTTGAAGACATGGAAAAAGTAAAAG CAA CAGAGAGGT T CT T 
T TGAGAAATACAAAGAT TGT TGAAGACAT GGAAAAAGTAAAAG CAACA GAGAGGT T C T T 
TTGAGAAATACAAAGATTGTTGAAGACATGGAAAAAGTAAAAGCAACAGAGAGGTTCTT 
TTGAGAAATACAAAGATTGTTGAAGACATGGAAAAAGTAAAAGCAACAGAGAGGTTCTT 
TTGAGAAATACAAAGATTGTTGAAGACATGGAAAAAGTAAAAGCAACAGAGAGGTTCTT 



CCAACTCATCCTACTGGTCTTCTCAAAACAGGAACAACTGATAGGCACCA 

CCAACTCATCCTACTGGTCTTCTCAAAACAGGAACAACTGATAGGCACCAA 

CCAACT(^TCCTACTGGTCTTCTCAAAACAGGAACAACTGATAGGCACC^AAAACATT 

CCAACTCATCCTACTGGTCTTCTCAAAAC^GGAACAACTGAT^ 

CCAACTCATCCTACTGGTCTTCTCAAAACAGGAACAACTGA^ 

CC!AACTCATCCTACTGGTCTTCTCAAAACAGGAACAACTGATA 

CCAACTCATCCTACTGGTCTTCTCAAAACAGGAACAACTGATAG^ 

CCAACTCATCCTACTGGTCTTCTC^ 

CCAACTCATCCTACTGGTCTTCTCAAAACAGGAACAACTGATAGGCACCAAAAAA 



GATTCACAAACAGATATTTCGTTTGGAAAGGACTTTATAGAGGTCAGAATTCCGTGGCA 
GATTCACAAACAGATATTTCGTTTGGAAAGGACTTTATAGAGGTCAGAATTCCGTGGCA 
GATTCACAAACAGATATTTCGTTTGGAAAGGACTTTATAGAGGTCAGAATTCCGTGGCA 
GATTCACAAACAGATATTTCGTTTGGAAAGGACTTTATAGAGGTC^GAATTCCGTGGCA 
GATTCACAACCAGATATTTCGTTTGGAAAGGACTTTATAGAG<3TCAGAATTCCGTGGC^ 
GATTCACAAACAGATATTTCGTTTGGAAAGGACTTTATAGAGGTCAGAATTCCGTGGCA 
GATTCACMA(^GATATTTCGTTTGaAAAGGACTTTATAGAGGTaA.GAATTCCGTGGCA 
GATTCACAAACAGATATTTCGTTTGGAAAGGACTTTATAGAGGTCAGAATTCCGTGGCA 
GATTCA.CAAACAGATATTTCGTTTGGAAAGGACTTTATAGAGGTCAGAATTCCGTGGCA 



TTGTTGAATTTTTCTGATCCATCATCTCAAAAAATTCACGATGATTACTTTAAACATTA 
TTGTTGAATTTTTCTGATCCATCATCTCAAAAAATTCACGATGATTACTTTAAACATTA 
TTGTTGAATTTTTCTGATCCATCATCTCAAAGAATTCACGATGATTACTTTAAACATTA 
TTGTTGAATTTTTCTGATCCATCATCTCAAAAAATTCACGATGATTACTTTAAACATTA 
TTGTTGAATTTTTCTGATCCATCATCTCAAAAAATTCACGATGATTACTTTAAACATTA 
TTGTTGAATTTTTCTGATCCATCATCTGAAAAAATTCACGATGATTACTTTAAACATTA 
TTGTTGAATTTTTCTGATCCATCATCTCAAAAAATTCACGATGATTACTTTAAACATTA 
TTGTTGAATTTTTCTGATCCATCATCTCAAAAAATTCACGATGATTACTTTAAACATTA 
TTGTTGAATTTTTCTGATCCATCATCTCAAAAAATTCACGATGATTACTTTAAACATTA 



GGTGTGAAGGAGTTAGAAATTGAGAG - - CATTGCTTTAGGATTAGGTGCTAATAGCAAA 
GGTGTGAAGGAGT TAGAAATTGAGAG - - CATTGCTTTAGGATTAGGTGCTAATAGCAAA 
GGTGTGAAGGAGTTAGAAAATTGAGAGCCATTGCTTTAGGATTAGGTGCTAATAGCAAA 
GGTGTGAAGGAGTTAGAAATTGAGAG - - CATTGCTTTAGGATTAGGTGCTAATAGCAAA 
GGTGTGAAGGAGTTAGAAATTGAGAG - - CATTGCTTTAGGATTAGGTGCTAATAGCAAA 
GGTGTGAAGGAGTTAGAAATTGAGAG- - CATTGCTTTAGGATTAGGTGCTAATAGCAAA 
GGTGTGAAGGAGTTAGAAATTGAGAG - - CATTGCTTTAGGATTAGGTGCTAATAGCAAA 
GGTGTGAAGGAGTTAGAAATTGAGAG - - CATTGCTTTAGGATTAGGTGCTAATAGCAAA 
GGTGTGAAGGAGTTAGAAATTGAGAG - - CATTGCTTTAGGATTAGGTGCTAATAGCAAA 



AAAACACACTGATAAAGATGGCAGATTATCGTTTGAAAAATTGGGAGAGACCCGATACC 
AAAACACACTGATAAAGATGGCAGATTATCGTTTGAAAAATTGGGAGAGACCCGATACC 
AAAACACACTGATAAAGATGGCAGATTATCGTTTGAAAAATTGGGAGAGACCCGATACC 
AAAACACACTGATAAAGATGGCAGATTATCGTTTGAAAAATTGGGAGAGACCCGATACC 
AAAACACACTGATAAAGATGGCAGATTATCGTTTGAAAAATTGGGAGAGACCCGATACC 
AAAACACACTGATAAAGATGGCAGATTATCGTTTGAAAAATTGGGAGAGACCCGATACC 
AAAACACACTGATAAAGATGGCAGATTATCGTTTGAAAAATTGGGAGAGACCCGATACC 
AAAACACACTGATAAAGATGGCAGATTATCGTTTGAAAAATTGGGAGAGACCCGATACC 
AAAACACACTGATAAAGATGGCAGATTATCGTTTGAAAAATTGGGAGAGACCCGATACC 
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SEQ2801 
SEQ2802 
SEQ2303 
SEQ2804 
SEQ2805 
SEQ2806 
SEQ2 807 
SEQ2808 
SEQ2809 
SEQ2 810 
SEQ2811 



AAACCTTTTTAAAAGACTCCTATTATGTATTAAGAAAGAA 

AAACCTTTTTAAAAGACTCCTATTATAGTATTAAGAAAGAATGGTCTAAAGAAAGAGAG 

AAACCTTTTTAAAAGA 

AAAC CTTTTTAAAAGACTC CTATTATGTATTAAGAAAGA 

AAACCTTTTTAAAAGACT ~~ 

AAACCTTTTTAAAAGACTCCTATTATAGT • 

AAACCT TT T T AAAAGA CT C CT ATT ATAGTAT TAAGAAAG 

AAACCTTTTTAAAAGACTCCTATTATAGTATTAAG 

AAACCTTTTTAAAAGACTCCTATTATAGTATTAAGAAAGAATGG 



SEQ2801 

SEQ2802 

SEQ2803 

SEQ2 804 GAACATATGGTCCA 

SEQ2805 

SEQ2806 

SEQ2807 

SEQ2808 

SEQ2809 

SEQ2810 

SEQ2811 



>SEQ ID NO 2850:62_1169NT frame: 1 

FWKGDTVLHKPTNKPFVVKGVDVESSLAGYH^^ 

KVPMNVAF YDAL YHHNKASKRPL YLLQGI R I DS YRNNAS I TAFNDN YRGYL KREAKGWD 
ILHGRKQVWNTDFGSRHYHTOLSP^WLGYWGDDWNSGTVAYTOHQEKKTQYKGRYFKTS 
AAAN P F EVMIAQVMDE LTH YE T AKYGVJ QHLi I S F SN S PT TD PFR YRKP FEAQ APKYVQLNV 
ENI QANSNVKAGI FAA YKAIDFHPRYKD YLLFDKENI S KEDRQKI KELSL S QG YVKLLNA 
YHKI PVLVTGYGYS TARGI AQKE I DKRPLP INEKEQGQRLLED YES F I S SGS FGAT INAW 
QDDWNARAWNTSFATNKHSQFLWGDAQVFNQGYGLLGFKNAKHHYQVDGK^GKGEWKHPL 
MTSATGDDLYASSDESYLYIiAIKTKPEKLKEKRLLPIDITPKSGSRKMNGSKVTFSKSSD 
FVLSIDPNGKSELFVQERYNAIiKANYLRQLNGKDFYAFPPKKNSSNFEQINIWLRNTKIV 
EDMEKVKATERFLPTHPTGLLKTGTIDRHQKTFDSQTDISFGKDFIEVRIPWQLLNFSDP 
S SQKI HDD YFKHYGVKELEI E S I ALGLGANSKENTLI KMADYRLKNWERPDTKT FLKDS Y 
YSI.ER 

>SEQ ID NO 2851: 62_18RS21 frame: 1 

KGLLKJSNTRTNFVVKGDTVLHKPTNKPFV^ 

I SNMGANTVRVKVPMNVAF YDAL YHHNKAS KRPLYLLQGIRIDS YRNNAS I TAFNDNYRG 
YLK^EAKGVVDILHGRKQVWNTDLGSRHYHYDLSP^WLGYWGDDWNSGTVAYTNHQEKK 

TQYKGRYFKTS VAANPFEV>niAQVMDELTHYETAKYGWQHLI SFSNS P TTDPFHYRKP FE 
AQAPKYVQLNVENIQANSNVKAGMFAAYKAIDFHPRYKDYLIiFDKENI S KEDRQKI KELS 
LSQGYVKLLNAYHKIPVIiVTGYGYSTARGIAQKEIDKRPLPINEKEQGQRLLEDYESFIS 
SGS FGATINAVJQDDVJNARAWNT S FATNKHS QFXiWGDAQVFNQGYGIjLGFKNAKHHYQVDG 
KRGKGEWKHPLMTSATGDDLYASSDESYLYIiAIKTKPEKLKEKRLLPIDITPKSGSRKMN 
GSKVTFSKSSDFVLSIDPNGKSELFVQERYNALKANYLRQLNGKDFYAFPPKKNSSNFEQ 
I^WTjRNTKIVEDMEKVKATERFLPTHPTGLLKTGTTDRHQKTFDSQTDISFGKDFIEVR 
IPWQLLNFSDPSSQKIHDDYFKHYGVKELEIESIALGLGANSKENTLIKRADYRLKNWER 

PDTKTFLKDSYYVLRK 

>SEQ ID NO 2852:62_2603 frame: 3 

LKENTRTNFWKGDTVIjHKPTNKPFVVKGVDVESSIjAGYHHNDFP I TQKTYREWFHLISN 
MGANTVRVKVPMNVAFYDALYHHNKASKRPLYIjLQG 

re^kgvvdilhgrkqvwntdlgsrhyhydlspwvlgywgddwnsgtvaytnhqekktqy 

kgryfktsvaanpfevmlaqvmdelth yetakygwqhli s fsnspttdpfhyrkpfeaqa 
pkyvqlnveniqansnvkagmfaaykaidfhprykdyllfdkeniskedrqkikelslsq 
gyvkllnayhkipvlvtgygystargiaqkeidkrplpinekeqgqrlledyesfissgs 
fgatinawqdd^aratotsfatnkhsqflwgdaqvfnqgygllgfknakhhyqvdgkrg 
kgewkhplmtsatgddlyassdesyiiylaiktkpeklkekriilpiditpksgsrkmngsk 

VTFSKSSDFVLSIDPNGKSELFVQERYNALKANYLRQLNGKDFYAFPPKKNSSNFEQINM 
VLRNTKIVEDMEKVKATERFLPTHPTGLLKTGTTDRHQKTFDSQTDI S FGKDFI EVRI PW 
QLLNFSDPSSQKIHDDYFKHYGVKELEIESIALGLGANSKENTLIKMADYRLKNWERPDT 
KTFLKDS YYS IKKEWS KERERT YGP 
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>SEQ ID NO 2853:62_A909 frame: 1 

KGLLKENTRTNFWKGDTVLHKPTNKPFVVKGVDVESSIiAGYHHNDFPITQKTYREWFHL 
ISNMGAimmVKVPMNVAFYDALYHHNKASKRPLYLLQGIRIDSYRiJNASITAFNDNYRG 
YLKREAKGWD I LHGRKQVWNTDLGSRHYH YDL S PWVLG YVVGDDWNSGTVAYTNHQEKK 
TQ YKGR YFKTS VAANP FEVMLAQVMDELTHYETAKYGWQHL I S FSNS PTTDPFHYRKP FE 
AQAPKYVQLNVENIQANSNVKAGMFAAYKAIDFHPRYKDYLLFDKENISKEDRQKIKELS 
LSQGYVKLLNAYHKI PVLVTGYGYSTARGIAQKEIDKRPLPINEKEQGQRLLEDYES FI S 
SGS FGAT INAWQDDWNARAWNTS FATNKHS QFLWGDAQVFNQG YGLLGF KNAKHH YQVDG 
KRGKGEWKHPLMTSATGDDLYASSDESYIiYLAIKTKPEKLKEKRLIiPIDITPKSGSRKMN 
GSKVTFSKSSDFVLSIDPNGKSELFVQERYNALKANYLRQLNGKDFYAFPPKKNSSNFEQ 
INMVLRNTKI VEDMEKVKATERFLPTHPTGLLKTGTTDRHQKTFDS QTD I S FGKDF I EVR 
I PWQLLNFSDP S S QRI HDD YFKH YGVKELEN . EPLL . D . VLIAKKTH . . RWQIIV . KIGR 
DPIPKPF.K 

>SEQ ID NO 2854:62_A909 frame: 1 

KGLLKENTRTNFVVKGDTVLHKPTNKP FWKGVDVES S LAGYHHND FP I TQKT YRE WFHL 
ISNMGANTVRVKVPMNVAFYDALYHHNKASKRPLYLLQGIRIDSYRNNASITAFNDNYRG 
YLKREAKGWD I LHGRKQVWNTDLGS RH YH YDLS PWVLGYWGDDWNS GTVAYTNHQEKK 
TQ YKGR Y F KT S VAANP F EVMLAQ VMDE LTH YE TAK YGW QHL I S FSNS PTTDP FH YRKPF E 
AQAPKYVQLNVENIQANSNVKAGMFAAYKAIDFHPRYKDYLiLFDKENI SKEDRQKI KELS 
LSQGYVKLLNAYHKIPVLVTGYGYSTARGIAQKEIDKRPLPINEKEQGQRLLEDYESFIS 
SGS FGATI NAWQDDWNARAWNTS FATNKHS QFLWGDAQWNQGYGLLGFKNAKHHYQVDG 
KRGKGEWKHPLMTSATGDDLYASSDESYLYIAIKTKPEKLKSKRLLPIDITPKSGSRKMN 
GSKVTF SKS SDFVL S IDPNGKS EL FVQER YNAiKANYLRQLNGKDF YAFPPKKNS SNFEQ 
INMVLRNTKI VEDMEKVKATERFL PTHPTGLLKTGTTDRHQKT FDS QTD I S FGKDF I EVR 
I PWQLLNFSD P S SQRI HDD YFKHYGVKELEN . EPLL . D . VLIAKKTH . . RWQI IV . KIGR 
DPIPKPF . K 

>SEQ ID NO 2855 :62_CJB 110 frame: 1 

YYFDGS LYLP KGLLKENTRTNFVVKGDTVLHKPTNKP FVVKGVDVE S S LAG YHHNDFP I T 
QKTYREWFHLIS^GANTVRVKVPNINVAFYDALYHHNKASKRPLYLLQGIRIDSYRNNAS 
ITAFNDNYRGYLKREAKGWD I LHGRKQVWNTDFGSRH YH YDL S PWVLGYWGDDWNS GT 
VAYTNHQEKKTQYKGRYFKTSVAANPFF^LAQVMDELTHYETAKYGWQHLISFSNSPTT 
DPFHYRKP FEAQAPKYVQLNVENI OANSNVKAGMFAAYKA I DFHPR YKD YLL FDKENI S K 
EDRQKI KELSLSQGYVKLLNAYHKI PVLVTGYGYSTARGIAQKEIDKRPLPINEKEQGQR 
LLED YE S F I S S GS FGAT INAWQDDWNARAWNTS FATNKHNQFLWGDAQVFNQGYGLLGFK 
NAKHHYQVDGKRGKGEWKHPLMTSATGDDLYASSDESYLYLAIKTKPEKLKEKRLLPIDI 
TPKSGSRKMNGSKVTFSKSSDFVLSIDPNGKSELFVQERYNALKANYLRQLNGKDFYAFP 
PKKNSSNFEQINMVLRNTKIVEDMEKVKATERFLPTHPTGLLKTGTTDRHQKTFDSQTDI 
SFGKDFI EVRI PWQLLNFSDPS SQKIHDDYFKHYGVKELE I ES I ALGLGANSKENTLI KM 
ADYRLKNWERPDTKTFLKDSYYVLRK 

>SEQ ID NO 2856:62_COHl frame: 2 

LPQGLL KENTRTNFWKGDTVLHKPTNKPF WKGVDVE S SLAG YHHND FP I TQKT YREWF 
HLI SNMGANTVRVKVPMNVAF YDAL YHHNKES KRPLYLLQGIRIDS YRNNAS I TAFNDNY 
RGYLKREAKGVVDI LHGRKQVWNTDFGSRH YH YDL S PWVLGYWGDDWNS GTVA YTNHQE 
KKTQYKGRYFKTSVAANPFEVMLAQVMDELTHYETAKYGWQHLISFSNSPTTDPFHYRKP 
FEAQAP KYVQLNVENI QANSNVKAGMFAAYKAIDFHPR YKDYLLFDKENI S KEDRQKI KE 
LSLSQGYVKLLNAYHKIPVLVTGYGYSTARGIAQKEIDKRPLPINEKEQGQRLLEDYESF 
IS S GS F GAT INAWQDDWNARAWNT S FATNKHS QFLWGDAQVFNQG YGLLGFKNAKHHYQV 
DGKRGKGEWKHPLMTSATGDDLYASSDESYLYLAIKTKPEKLKEKRLLPIDITPKSGSRK 
MNGSKVTFSKSSDFVLSIDPNGKSELFVQERYNALKANYLRQLNGKDFYAFPPKKNSSNF 
EQINMVLRNTKIVEDMEKVKATERFLPTHPTGLLKTGTTDRHQKTFDSQPDISFGKDFIE 
VRI PWQLLNFSDPS SQKIHDDYFKHYGVKELE I ES IALGLGANSKENTLI KMADYRLKNW 
ERPDTKTFLKD 

>SEQ ID NO 2857:62_H36B frame: 2 

RGLLKENTRTNFWKGDTVLHKPTNKPFWKGVDVESSLAGYHHNDFPITQKTYREWFHL 
I SNMGANTVRVKVPMNVAF YDAL YHHNKAS KR PL YLLQGI RID S YRNNAS I TAFNDNYRG 
YLKREAKGWD I LHGRKQVWNTDFGS S HYH YDLS P WVLGYWGDDGHS GTVALY 

>SEQ ID NO 2858:62_JM9130013 frame: 3 

FWKGDTVLHKPTNKP FWKGVDVES SLAG YHHNDFP I TQKT YREWFHL I SNMGANTVRV 
KVPMNVAF YDAL YHHNKAS KRPLYLLQGI RIDS YRNNAS I TAFNDNYRGYLKREAKGWD 
ILHGRKQVWNTDFGS S HYH YDLS PWVLG YWGDDWNS GTVAYTNHQEKKTQ YKGR YFKT S 
VAANPFEVMLAQVMDELTHYETAKYGWQHLI S FSNSPTTDPFH YRKPFEAQAPKYVQLNV 
ENIQANSNVKAGMFAAYKAIDFHPRYKDYLLFDKENIS KEDRQKI KELSLSQGYVKLLNA 
YHKIPVLVTGYGYSTARGIAQKEIDKRPLPINEKEQGQRLLEDYESFISSGSFGATINAW 
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QDDWNAR WNT S FATNKHS QFLWGDAQVFNQ GYGLL GF KNAKHH YQ VDGKRGKE E W KH P L 
MTSATGDDLYASSDESYLYLAIKTKPEKLKEKRLLPIDITPKSGSRKMNGSKVTFSKSSD 
FVLSIDPNGKSELFVQERYNALKANYLRQLNGKDFYAFPPKKNSSNFEQINMVLRNTKIV 
EDMEKVKATERFLPTHPTGIiLKTGTTDRHQKTFDSQTDISFGKDFIEVRIPWQLLNFSDP 
SSQKIHDD YFKHYGVKELE IES IALGLGANSKENTLI KMADYRLKNWERPDTKTFLKDSY 
YSIKK 

>SEQ ID NO 2859:62_M732 frame: 2 

TRTNFWKGDTVLHKP TNKP FWKGVD VES S LAGYHHNDF PI TQKTYREWFHLI SNMGAN 
TVRVKVPMNVAF YDALYHHNKESKRPLYLLQGI RIDS YRNNAS I TAFND 
GWD I LHGRKQVWNTDFGS RHYHYDL S P WVLGYVVGDDCNSGTVAYTNHQEKKTQ YKGR Y 
FKTSVAANPFEVMLAQVMDELTHYETAKYGWQHLISFSNSPTTDPFHY^ 
QLNVENIQANSNVKAGMFAAYKAIDFHPRYKDYLLFDKENISKEDRQKIKELSLSQGYVK 
LLNAYHKI PVLVTGYGYSTARGIAQKEIDKJRPLPINEKEQGQRLLEDYESFISSGS FGAT 
INAWQDDWNARAWNTS FATNKHS Q FLWGDAQVFNQGYGLLGFKNAKHH YQVDGKRGKGEW 
KHPLMTSATGDDLYASSDESYIiYIiAIKTKPEKIjKEKRLLPIDITPKSGSRKMNGSKVTFS 
KS SDFVLS IDPNGKS EL FVQER Y1JALKANYLRQLNGKDF YAFP PKKNS SNFEQ I NMVLRN 
TKI VEDMEKVKATERFLPTHPTGIiXiKTGTTDRHQKTFDSQTDI SFGKDFI EVRI PWQLLN 
FSDPSSQKIHDDYFKHYGVKELEI ES IALGLGANSKENTLI KMADYRLKNWERPDTKTFL 
KDSYYSIK 

t 

>SEQ ID NO 2860:62_M781 frame: 1 

FDGSLYLPQGLLKENTRTNFWKGDTVLHKPTNKPFVVKGVDVESSIiAGYHHNDFPITQK 
TYREWFHLISNMGANTVRVKVPMNVAF 

AFNDNYRG YLKREAKGVVD I LHGRKQVWNTDFGS RHYHYDL S P WVLGYWGDDWNS GTVA 
YTNHQEKKTQYKGRYFKTS VAANP FEVMLAQVMDELTHYETAKYGWQHL I S FSNS PTTD P 
FHYRKP FEAQAPKYVQLNVENI QANSNVKAGMFAAYKAIDFHPRYKD YLL FDKENI SKED 
RQKI KELSLSQGYVKLLNAYHKI PVLVTGYGYSTARGIAQKEIDKRPLPINEKEQGQRLL 
EDYE S F I S S GS FGATINAWQDDWNARAWNTS FATNKHS QFLWGDAQVFNQGYGLLGFKNA 
KEHYQVDGKRGKGEWKHPLMTSATGDDLYASSDESYLYLAIKTKPEKLKEKRLLPIDITP 
KSGSR^INGSKVTFSKSSDFVLSIDPNGKSELFVQERYNALKANYLRQLNGKDFYAFPPK 
KNS SNFEQ INMVLRNTKI VEDMEKVKATERFLPTHPTGLLKTGTTDRHQKT FDS QTD I S F 
GKD F I EVRI P WQLLNF SD PS SQKI HDD YFKHYGVKELE IES I ALGLGANS KENTLI KMAD 
YRL KNWERPDTKTFLKDS Y YS I KKEW 



SEQ2850 FWKGDTVLHKPTNKPFWKGVDVESSLAGYHHNDFPIT 

SEQ2851 KGLL KENTRTNFWKGD TVLHKP TNKP FWKGVD VES S LAG YHHNDF PIT 

SEQ2852 LKENTRTNFVWGDTVLHKPTNKPFVVKGVDVESSLAGYHHNDFPIT 

SEQ2 853 KGLLKENTRTNFVVKGDTVLHKPTNKPFVVKGVDVESSLAGYHHNDFPIT 

SEQ2 854 KGLL KENTRTN FWKGD TVLH KP TNKP F WKGVD VE S S LAG YHHND F P I T 

SEQ2855 YFDGSLYLPKGLLKENTRTNFWKGDTVLHKPTNKPFWKGVDVESSLAGYHHNDFPIT 

SEQ2 856 L PQGLLKENTRTNFWKGDTVLHKPTNKP FWKGVD VES SLAG YHHNDF P I T 

SEQ2 857 RGLLKENTRTNFVVKGDTVLHKPTNKP FWKGVDVES SLAGYHHNDF PI T 

SEQ2858 FVVKGDTVLHKPTNKPFVVKGVDVESSLAGYHHNDFPIT 

SEQ2 859 TRTNFWKGDTVLHKPTNKP FWKGVDVES S LAG YHHNDFP I T 

SEQ2 860 - FDGSL YliPQGLLKENTRTNFVVKGDTVLHKP TNKP FWKGVDVES S LAG YHHNDFP I T 

SEQ2850 QKTYREWFHLISNMGANTVRVKVPM1WAFYDALYHHNKASKRPLYLLQGIRIDSYRNNAS 

SEQ2851 QKTYREWFHLIS^GANTVRVKVPMNVAFYDALYHHNKASKRPLYLLQGIRIDSYRNNAS 

SEQ2852 QKTYREWFHLISNMGANTVRVKVPNTbTVAFYDALYHHNKASKRPLYLLQGIRIDSYRNNAS 

SEQ2 853 QKTYREWFHLI SNMGANTVRVKVPMNVAF YDALYHHNKASKRPLYLLQGIRIDS YRNNAS 

SEQ2854 QKTYREWFHLIS^IGANTVRVKVPMWAFYDALYHHNKASKRPLYLLQGIRIDSYRNNAS 

SEQ2855 QKTYREWFHLI SNMGANTTOVKVPMNVAF YDALYHHNKASKRPLYLLQGIRIDS YRNNAS 

SEQ2 856 QKTYREWFHLI SNMGANTVRVKVPMNVAF YDALYHHNKESKRPLYLLQGIRIDS YRNNAS 

SEQ2857 QKTYREWFHLI SNMGANTVRVKVPMNVAF YDALYHHNKASKRPLYLLQGIRIDS YRNNAS 

SEQ2 858 QKTYREWFHL I SNMGANTWVKVPMNVAF YDALYHHNKASKRPLYLLQGIRIDS YRNNAS 

SEQ2859 QKTYREWFHLI SNMGANTVRVKVPMNVAFYDALYHHNKESKRPLYLLQGIRIDS YRNNAS 

SEQ2860 QKTYREWFHLISNMGANTVRVKVPMNVAFYDALYHHNKESKRPLYLLQGIRIDS YRNNAS 

SEQ2 850 I TAFNDNYRGYLKREAKGWD I LHGRKQVWNTDFGS RH YH YDLS PWVLG YWGDDWNSGT 

SEQ2 851 I TAFNDNYRGYLKREAKGWD I LHGRKQVWNTDLGS RH YHYDLS PWVLGYWGDDWNSGT 

SEQ2 852 I TAFNDNYRGYLKREAKGWD I LHGRKQVWNTDLGS RH YHYDLS PWVLGYWGDDWNSGT 

SEQ2 853 I TAFNDNYRGYLKREAKGWD I LHGRKQVWNTDLGS RHYHYDL S PWVLGYWGDDWNSGT 

SEQ2 854 I TAFNDNYRGYLKREAKGWD I LHGRKQVWNTDLGS RH YHYDLS PWVLGYWGDDWNSGT 

SEQ2 855 I TAFNDNYRGYLKREAKGWDI LHGRKQVWNTDFGS RHYHYDL S PWVLGYWGDDWNSGT 

SEQ2 856 I TAFNDNYRG YLKREAKGWDI LHGRKQVWNTDFGSRH YH YDL S PWVLGYWGDDWNSGT 

SEQ2 857 I TAFNDNYRGYLKREAKGWD I LHGRKQVWNTDFGS SH YH YDL SPWVLGYWGDDGHSGT 

SEQ2858 I TAFNDNYRGYLKREAKGWD I LHGRKQVWNTDFGS SH YHYDLS PWVLGYWGDDWNSGT 
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Table 28: Comparative Sequences relating to SAG1552 
(conserved hypothetical protein) 



SEQ2859 
SEQ2 86 0 
SEQ2 850 
SEQ2851 
SEQ2852 
SEQ2 853 
SEQ2854 
SEQ2855 
SEQ2 856 
SEQ2857 
SEQ2858 
SEQ2859 
SEQ2860 

SEQ2850 
SEQ2 851 
SEQ2852 
SEQ2853 
SEQ2854 
SEQ2855 
SEQ2856 
SEQ2857 
SEQ2858 
SEQ2859 
SEQ2860 

SEQ2850 
SEQ2 851 
SEQ2852 
SEQ2 853 
SEQ2854 
SEQ2855 
SEQ2856 
SEQ2857 
SEQ2858 
SEQ2859 
SEQ2860 

SEQ2850 
SEQ2851 
SEQ2 852 
SEQ2853 
SEQ2 854 
SEQ2 855 
SEQ2 856 
SEQ2857 
SEQ285 8 
SEQ2 85 9 
SEQ2860 

SEQ2850 
SEQ2851 
SEQ2852 
SEQ2853 
SEQ2 854 
SEQ2855 
SEQ2856 
SEQ2857 
SEQ2858 
SEQ2859 
SEQ2860 

SEQ2850 
SEQ2851 
SEQ2 852 
SEQ2853 
SEQ2854 
SEQ2855 



ITAFND^RGYLKREAKGVVDILHGRKQVWNTDFGSRHYHYDLSPWVLGYVVGDDCNSGT 
I TAFNDNYRGYLKRFAKGVVDILHGRKQVWNTDFGSRHYHYDLS PWVLGYVVGDDWNS GT 
VAYTNHQEKKTQ YKGR YFKTSAAANPF EIVMLAQVMDELTH YETAKYGWQHL I S F SNS PTT 
VAYTNHQEKKTQYKGRYFKTSVAANPFEVMLAQVMDELTHYETAKYGWQHL ISFSNSPTT 
VAYTNHQEKKTQYKGRYFKTSVAANPFEVMIiAQVMDELTHYETAKYGWQHLISFSNSPTT 
VAYTNHQEKKTQYKGRYFKTSVAANPFEVMLAQVMDELTHYETAKYGWQHLISFSNSPTT 
VAYTiraQEKKTQYKGRYFKTSVAANPFEVMIiAQVMDELTHYETAKYGWQHIilSFSNSPTT 
VAYTNHQEKKTQYKGRYFKTSVAANPFEVMLAQVMDELTHYETAKYGWQHL I SFSNS PTT 
VAYTNHQEKKTQYKGRYFKTSVAANPFEVMLAQVMDELTHYETAKYGWQHL ISFSNSPTT 

VALY 

VAYTNHQEKKTQYKGRYFKTSVAANPFEVMIiAQVMDELTHYETAKYGWQHLISFSNSPTT 
VAYTNHQE KKTQ YKGR YFKTS VAANP F EVMIiAQVMDELTH YETAKYGWQHL I S F SNS PTT 
VAYTNHQEKKTQYKGRYFKTSVAANPFEVMLAQVMDELTHYETAKYGWQHL ISFSNSPTT' 

PFRYRKPFEAQAPKWQLNVENIQANSNVKAGIFAAYKAIDFHPRYKDYLLFDKENISK 
PFHYRKPFEAQAPKYVQLNVENIQANSNVKAGMFAAYKAIDFHPRYKDYLLFDKENISK 
PFHYRKPFEAQAPKYVQIiNVENIQANSNVKAGMFAAYKAIDFHPRYKD YLL FDKENISK 
P FH YRKP FEAQ APKWQLNVENI QANSNVKAGMFAAYKAI DFHPRYKD YLL FDKENI SK 
PFHYRKPFFAQAPKYVQLNVENIQANSNVKAGMFAAYKAIDFHPRYKDYLLFDKENISK 
P FH YRKP FEAQAPKYVQLNVEN I QANSNVKAGMFAAYKAI DFHPR YKD YLL FDKENI S K 
P FH YRKPF FAQAPKYVQLNVENI QANSNVKAGMFAAYKAI DFHPR YKD YLL FDKENI S K 

PFHYRKPFEAQAPKYVQLNVENI QANSNVKAGMFAAYKAIDFHPRYKDYLIt FDKENI SK 
PFHYRKPFEAQAPKWQLNVENIQANSNVKAGMFAAYKAIDFHPRYKDYLLFDKENISK 
PFHYRKPFEAQAPKWQLNVENIQANSNVKAGMFAAYKAIDFHPRYKD YLL FDKENI SK 

DRQKI KELS L S QGYVKLLNAYHKI PVLVTG YGYS TARGI AQKE I DKRPLP I NEKEQGQR 
DRQKI KELS LS QGYVKLLNAYHKI PVLVTG YGYS TARGI AQKE IDKRPLPINEKEQGQR 
DRQKI KELSLSQGYVKLLNAYHKI PVLVTGYGYSTARGIAQKEIDKRPLPI NEKEQGQR 
DRQKIKELSLSQGYVKLLNAYHKIPVLVTGYGYSTARGIAQKEIDKRPLPINEKEQGQR 
DRQKI KEL SLS QGYVKLLNAYHKI PVLVTG YGYS TARGI AQKE I DKRPLP I NEKEQGQR 
DRQKI KELSLSQGYVKLLNAYHKI PVLVTGYGYSTARGIAQKEIDKRPLPINEKEQGQR 
DRQKI KELSLSQGYVKLLNAYHKIPVLVTGYGYSTARGIAQKEIDKRPLPINEKEQGQR 

DRQKI KELSLSQGYVKLLNAYHKI PVLVTGYGYSTARGIAQKEIDKRPLPI NEKEQGQR 
DRQKI KELSLSQGYVKLIjNAYHKI PVLVTGYGYSTARGIAQKEIDKRPLPI NEKEQGQR 
DRQKIKELSLSQGYVKLLNAYHKI PVLVTG YGYS TARGI AQKE IDKRPLPINEKEQGQR 

LEDYESFISSGSFGATINAWQDDWNARAWNTSFATNKHSQFLWGDAQVFNQGYGLLGFK 
LED YES F I S SGS FGATINAWQDDWNARAWNTS FATNKHS QFLWGDAQVFNQGYGLLGFK 
LED YES F I S S GS FGATI NAWQDDWNARAWNTS FATNKHS QFLWGDAQVFNQ GYGLLGFK 
LED YES F I S S GS FGATINAWQDDWNARAWNTS FATNKHS QFLWGDAQVFNQGYGLL GFK 
LED YE S F I SS GS FGATINAWQDDWNARAWNTS FATNKHS QFLWGDAQVFNQGYGLLGFK 
LED YE S F I S S GS FGATINAWQDDWNARAWNTS FATNKHNQFLWGDAQVFNQGYGLLGFK 
LEDYES FISSGS FGATINAWQDDWNARAWNTS FATNKHSQFLWGDAQVFNQ GYGLLGFK 

LED YES F I S SGS FGATINAWQDDWNARVWNTS FATNKHSQFLWGDAQVFNQGYGLLGFK 
LED YES F I S S GS FGATINAWQDDWNARAWNTS FATNKHS QFLWGDAQVFNQGYGLLGFK 
LEDYESFISS GS FGAT INAWQDDWNARAWNTS FATNKHS QFLWGDAQVFNQGYGLLGFK 

AKHHYQVDGKRGKGEWKHPLMTS ATGDDL YAS SDES YL YLAI KTKPEKLKEKRLLPIDI 
AKHHYQVDGKRGKGEWKHPLMTSATGDDLYASSDESYL'YLAIKTKPEKLKEKRLLPIDI 
AKHHYQVDGKRGKGEWKHPLMTSATGDDL YAS SDES YL YLAI KTKPEKLKEKRLLPIDI 
AKHHYQVDGKRGKGEWKHPLMTSATGDDLYAS SDESYLYLAI KTKPEKLKEKRLLPIDI 
AKHHYQVDGKRGKGEWKHPLMTSATGDDL YASSDESYLYLAIKTKPEKLKEKRLLPIDI 
AKHHYQVDGKRGKGEWKHPLMTSATGDDL YASSDESYL YLAI KTKPEKLKEKRLLP IDI 
AKHHYQVDGKRGKGEWKHPLMTSATGDDL YAS SDESYLYLAI KTKPEKLKEKRLLPIDI 

AKHHYQVDGKRGKEEWKHPLMTS ATGDDL YAS SDES YL YLAI KTKPEKLKEKRLLP IDI 
AKHHYQVDGKRGKGEWKHPLMTSATGDDLYAS SDESYLYLAI KTKPEKLKEKRLLPIDI 
AKHHYQVDGKRGKGEWKHPLMTSATGDDLYASSDESYLYLAIKTKPEKLKEKRLLPIDI 

PKSGSRKMNGSKVTFSKSSDFVLSIDPNGKSELFVQERYNALKANYLRQLNGKDFYAFP 
PKSGSRKMNGSKVTFSKSSDFVLSIDPNGKSELFVQERYNALKANYLRQLNGKDFYAFP 
PKSGSRKMNGSKVTFSKSSDFVLSIDPNGKSELFVQERYNALKANYLRQLNGKDFYAFP 
PKSGSRKMNGSKVTFSKSSDFVLSIDPNGKSELFVQERYNALKANYLRQLNGKDFYAFP 
PKS GS RKMNGS KVTFS KS SDFVLS I DPNGKS EL FVQER YNALKAN YLRQLNGKD F YAFP 
PKS GSRKMNGS KVTFS KS S DFVLS I DPNGKS ELFVQER YNALKANYLRQLNGKDF YAFP 
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Table 28: Comparative Sequences relating to SAG1552 
(conserved hypothetical protein) 



SEQ2 856 
SEQ2857 
SEQ285 8 
SEQ2 85 9 
SEQ28 60 

SEQ2 850 
SEQ2 851 
SEQ2 852 
SEQ2853 
SEQ2 854 
SEQ2855 
SEQ285 6 
SEQ2857 
SEQ285 8 
SEQ2859 
SEQ2860 

SEQ2850 
SEQ2 851 
SEQ2852 
SEQ2 853 
SEQ2 854 
SEQ2855 
SEQ2856 
SEQ2857 
SEQ2 85 8 
SEQ2 85 9 
SEQ2860 

SEQ2850 
SEQ2851 
SEQ2852 
SEQ2853 
SEQ2854 
SEQ2855 
SEQ2856 
SEQ2857 
SEQ285 8 
SEQ285 9 
SEQ2860 



PKSGSRKMNGSKVTFSKSSDFVLSIDPNGKSEIiFVQERYNAIjKANYLRQLNGKDFYAFP 

PKSGSRKMNGSK^TFSKSSDFVLSIDPNGKSELFVQERYNALKANYLRQLNGKDFYAFP 
PKSGSRKMNGSKVTFSKSSDFVLSIDPNGKSELFVQERYNALKANYLRQLNGKDFYAFP 
PKSGSRKMNGSKVTFSKSSDFVLSIDPNGKSELFVQERYNALKANYLRQLNGKDFYAFP 

KKNSSNFEQINMVLRNTKIVEDMEKVKATERFLPTHPTGLLKTGTIDRHQKTFDSQTDI 
KKNSSNFEQINMVLRNTKIVFJDMEKVKATERFLPTHPTGLLKTGTTDRHQKTFDSQTDI 
KKNSSNFEQINMVLRNTKIVEDMEKVKATERFLPTHPTGLLKTGTTDRHQKTFDSQTDI 
KKNSSNFEQINMVLRNTKIVEDMEKVKATERFLPTHPTGLLKTGTTDRHQKTFDSQTDI 
KKNSSNFEQINMVLRNTKIVEDMEKVKATERFLPTHPTGLLKTGTTDRHQKTFDSQTDI 
KKNSSNFEQINMVLRNTKIVEDMEKVKATERFLPTHPTGLLKTGTTDRHQKTFDSQTDI 
KKNSSNFEQINMVLRNTKIVEDMEKVKATERFLPTHPTGLLKTGTTDRHQKTFDSQPDI 

KKNSSNFEQINMVLRNTKIVEDMEKVKATERFLPTHPTGLLKTGTTDRHQKTFDSQTDI 
KKNSSNFEQINMVLRNTKIVEDMEKVKATERFLPTHPTGLLKTGTTDRHQKTFDSQTDI 
KKNSSNFEQINMVLRNTKIVEDMEKVKATERFLPTHPTGLLKTGTTDRHQKTFDSQTDI 

FGKDFIEVRIPWQLLNFSDPSSQKIHDDYFKHYGVKELEIESIALGLGANSKENTLIKM 
FGKDFI EVRI PWQLLNFSDPS S QKIHDD YFKHYGVKELEI ES I ALGLGANS KEN TL I KM 
FGKDF I EVRI PWQLLNFSDPS S QKIHDD YFKEIYGVKELE I ES I ALGLGANS KENTL I KM 
FGKDF I EVRI PWQLLNFSDPSSQRIHDDYFKHYGVKELENEPLLDVL I AKKTHRWQ I IV 
FGKDFI EVRI PWQLLNFSDPS SQRIHDDYFKHYGVKELENEPLLDVL I AKKTHRWQ I IV 
FGKDF I EVRI PWQLLNFSDPS S QKIHDD YFKHYGVKELE I ES I ALGLGANSKENTL I KM 
FGKDF I EVRI PWQLLNFSDPS S QKI HDD YFKHYGVKELE I ES I ALGLGANS KENTL I KM 

FGKDF I EVRI PWQLLNFSDP S S QKI HDD YFKHYGVKELE I ES I ALGLGANS KENTL I KM 
FGKDF I EVRI P WQLLNFSDP S S QKI HDD YFKHYGVKELE I ES I ALGLGANS KENTL I KM 
FGKDF I EVRI PWQLLNFSDP S S QKIHDD YFKHYGVKELE I ES I ALGLGANS KENTL I KM 

D YRLKNWERPDTKT FL KDS YYS IER 

D YRLKNWERPDTKT FL KDS YYVLRK 

D YRLKNWERPDTKT FL KDS YYS IKKEWSKERERTYGP 

IGRDPIPKPFK 

IGRDPI PKPFK : 

DYRLKNWERPDTKTFLKDS YYVLRK 

DYRLKNWERPDTKTFLKD 

DYRLKNWERPDTKTFLKDS YYS IKK 

DYRLKNWERPDTKTFLKDS YYS IK 

DYRLKNWERPDTKTFLKDS YYS IKKEW 
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Table 29: Comparative Sequences relating to SAG1641 (YaeC family protein) 

SEQ ID NO. 2901: SAG1641 FROM THE 090 GBS TYPE la STRAIN 

AATCAAGAAGTTTCAGCAAGCTCAACTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACCTTTTCTGACACTGAAAAA 
GCACGTTGGGATAAAATTGAAAAGCTAGTAGGCGATAAAGCTAAAATCAAATTCACAGAATTTACAGATTATACACAA 
CCAAATCAAGCGAC AGC C AATAAGGATGTGGATATTAATGC CTTTC AACATTACAATTTC TTAGAAAACTGGAATAAG 
GAAAATAAGAAAAACTTAATTCCACTTGAAAAGACTTACTTAGCCCCAATTCGTATCTATTCTGAGAAGGTAAAATCT 
CTTAAAAAATTGAAAAAAGGAGCCACTATTGCAATTCCAAATGATGCAACAAATGGTAGCCGTGCATTGTATGTCCTT 
CAGTCAGCAGGTTTAATCAAATTGAATGTTTCTGGTAAGAAGGTTGCAACAGTTGCTAATATCACATCTAATAAAAAA 
GATATTAATATTCAGGAGTTAGATGCGAGTCAAACACCACGTGCACTCAAAGATGTAGATGCAGCTATTATTAATAAT 
ACATACATTGAGCAAGCTAATTTAAAACCTTCAGATGCTATCTTTGTTGAGAAATCAGATAAAAATTCAAAACAATGG 
ATTAATATCATTGCGGGACGTAAAAATTGGAAAAAGCAAAAGAACGCTAAAGCTATCCAAGCTATCTTGGATGCTTAT 
CAC ACAGATGAAGTGAAAAAAGTTATC AAAGATACTTC AGCTGATATTC C AC AATGGAAC C CAGCTTTC TTGTAC AA 

SEQ ID NO. 2902: SAG1641 FROM THE 1169NT1 GBS TYPE V STRAIN 
(REVERSE COMPLEMENT) 

ATCAAGAAGTTTCAGCAAGCTCAACTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACCTTTTCTGACACTGAAAAAG 
CACGTTGGGATAAAATTGAAAAGCTAGTAGGTGATAAAGCTAAAATCAAATTTACAGAATTTACAGATTATACACAAC 
CAAATCAAGCGACAGCCAATAAGGATGTGGATATTAATGCCTTTCAACATTACAATTTCTTAGAAAACTGGAATAAGG 
AAAATAAGAAAAAC TTAATTC C AC TTGAAAAGACTTACTTAGCTC CAATTCGTATCTATT CTGAGAAGGTAAAATCTC 
TTAAAAAATTGAAAAAAGGAGC CACTATTGC AATTC CAAATGATGCAAC AAATGGTAGCCGTGCATTGTATGT C CTTC 
AGTCAGCAGGTTTAATCAAATTGAATGTTTCTGGTAAGAAGGTTGCAACAGTTGCTAATATCACATCTAATAAAAAGG 
ATATTAATATTCAGGAGTTAGATGCGAGTCAAACACCACGTGCACTCAAAGATGTAGATGCAGCTATTATTAATAATA 
CATAC ATTGAGCAAGCTAATTTAAAAC CTTCAGATGCTATC TTTGTTGAGAAATCAGATAAAAATTCAAAACAATGGA 
TTAATATCATTGCGGGACGTAAAAATTGGAAAAAGCAAAAGAACGCTAAAGCTATCCAAGCTATCTTGGATGCTTATC 
AC ACAGATGAAGTGAAAAAAGTTATC AAAGAT ACTTCAGCTGATATT C CACAATGG 

SEQ ID NO. 2903: SAG1641 FROM THE 18RS21 GBS TYPE II STRAIN 

AATCAAGAAGT TT CAGCAAGCTC AACTTC AAGTAAAGTTGTTAAAGTTGGTGT TATGAC CTTTTCTGACAC TG AAAAA 
GCACGTTGGGATAAAATTGAAAAGCTAGTAGGTGATAAAGCTAAAATCAAATTTACAGAATTTACAGATTATACACAA 
C CAAATCAAGCGAC AGC CAATAAGGATGTGGATATTAATGCCTTT CAACATTACAATTTC TTAGAAAACTGGAATAAG 
GAAAATAAGAAAAACTTAATTCCACTTGAAAAGACTTACTTAGCTCCAATTCGTATCTATTCTGAGAAGGTAAAATCT 
CTTAAAAAATTGAAAAAAGGAGCCACTATTGCAATTCCAAATGATGCAACAAATGGTAGCCGTGCATTGTATGTCCTT 
CAGTCAGCAGGTTTAATCAAATTGAATGTTTCTGGTAAGAAGGTTGCAACAGTTGCTAATATCACATCTAATAAAAAG 
GATATTAATATTCAGGAGTTAGATGCGAGTCAAACACCACGTGCACTCAAAGATGTAGATGCAGCTATTATTAATAAT 
ACATACATTGAGCAAGCTAATTTAAAACCTTCAGATGCTATCTTTGTTGAGAAATCAGATAAAAATTCAAAACAATGG 
ATTAATATCATTGCGGGACGTAAAAATTGGAAAAAGCAAAAGAACGCTAAAGCTATCCAAGCTATCTTGGATGCTTAT 
CACACAGATGAAGTGAAAAAAGTTATCAAAGAT ACTTCAGCTGATATT C CAC 

SEQ ID NO. 2904: SAG1641 FROM THE 2603 V/R GBS TYPE V STRAIN 

AATCAAGAAGTTTCAGCAAGCTCAACTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACCTTTTCTGACACTGAAAAA 
GCACGTTGGGATAAAATTGAAAAG CTAGTAGGTGATAAAGCTAAAATC AAATT TACAGAATTTACAGATTATACACAA 
CCAAATCAAGCGACAGCCAATAAGGATGTGGATATTAATGCCTTTCAACATTACAATTTCTTAGAAAACTGGAATAAG 
GAAAATAAGAAAAACTTAATTCCACTTGAAAAGACTTACTTAGCTCCAATTCGTATCTATTCTGAGAAGGTAAAATCT 
CTTAAAAAATTGAAAAAAGGAGC CAC TATTGCAATT CCAAATGATGC AACAAATGGTAGC CGTGCATTGTATGTC CTT 
C AGT C AGC AG GT T T AAT C AAAT TG AATGT TT C TGGT AAGAAGGT TG C AACAGT TG C T AAT AT CAC AT C T AAT AAAAAG 
GATATTAATATTCAGGAGTTAGATGCCAGTCAAACACCACGTGCAPTCAAAGATGTAGATGCAGCTATTATTAATAAT 
AC AT AC ATTGAGCAAGCTAATTTAAAAC CTT CAGATGC TATCTT TGTTGAGAAAT C AGATAAAAATT C AAAAC AATGG 
ATTAATATCATTGCGGGACGTAAAAATTGGAAAAAGCAAAAGAACGCTAAAGCTATCCAAGCTATCTTGGATGCTTAT 
CACAC AGATGAAGTGAAAAAAGTT AT CAAAGATACT TC AGC TGATATT C CACAATGG 

SEQ ID NO. 2905: SAG1641 FROM THE A909 GBS TYPE la STRAIN 

AATCAAGAAGTTTCAGCAAGCTCAACTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACCTTTTCTGACACTGAAAAA 
GCACGTTGGGATAAAATTGAAAAGCTAGTAGGTGATAAAGCTAAAATCAAATTTACAGAATTTACAGATTATACACAA 
C CAAATCAAGCGAC AGC CAATAAGGATGTGGAT ATT AATGC CTTT CAACATTACAAT TTC TTAGAAAAC TGG AAT AAG 
GAAAATAAGAAAAAC TTAATTCC AC TTGAAAAGACT TACTTAGCTC C AATT CGTATCTAT TCTGAGAAGGTAAAATCT 
CTTAAAAAATTGAAAAAAGGAGCCACTATTGCAATTCCAAATGATGCAACAAATGGTAGCCGTGCATTGTATGTCCTT 
CAGTCAGCAGGTTTAATCAAATTGAATGTTTCTGGTAAGAAGGTTGCAACAGTTGCTAATATCACATCTAATAAAAAG 
GATATTAATATTCAGGAGTTAGATGCGAGTCAAACACCACGTGCACTCAAAGATGTAGATGCAGCTATTATTAATAAT 
AC ATAC ATTGAG C AAGC TAATTTAAAACC TTCAGATGC TATCT TTGTTGAGAAATCAGAT AAAAATTC AAAACAATGG 
ATTAATATCATTGCGGGACGTAAAAATTGGAAAAAGCAAAAGAACGCTAAAGCTATCCAAGCTATCTTGGATGCTTAT 
C ACACAGATGAAGTGAAAAAAGTTAT C AAAGAT ACTT C AGC TGATATTC CACAATGG 
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Table 29: Comparative Sequences relating to SAG1641 (YaeC family protein) 

SEQ ID NO. 2906: SAG1641 FROM THE CJB110 GBS NONTYPEABLE STRAIN 

AAGTAAAGTTGTTAAAGTTGGTGT TATGACC TT TTCTGAC ACTGAAAAAGCACGTTGGGATAAAATTGAAAAGCTAGT 
AGGCGATAAAGC TAAAAT CAAATT C ACAGAATTTAC AGATTATACAC AACCAAATC AAGCGACAGCCAATAAGGATGT 
GGATATTAATGCCTTTCAACATTACAATTTCTTAGAAAACTGGAATAAGGAAAATAAGAAAAACTTAATTCCACTTGA 
AAAGACTTACTTAGCCCCAATTCGTATCTATTCTGAGAAGGTAAAATCTCTTAAAAAATTGAAA2\AAGGAGCCACTAT 
TGCAATTCCAAATGATGCAACAAATGGTAGCCGTGCATTGTATGTCCTTCAGTCAGCAGGTTTAATCAAATTGAATGT 
TTCTGGTAAGAAGGTTGCAACAGTTGCTAATATCACATCTAATAAAAAAGATATTAATATTCAGGAGTTAGATGCGAG 
TCAAACACCACGTGCACTCAAAGATGTAGATGCAGCTATTATTAATAATACATACATTGAGCAAGCTAATTTAAAACC 
TTCAGATGCTATCTTTGTTGAGAAATCAGATAAAAATTCAAAACAATGGATTAATATCATTGCGGGACGTAAAAATTG 
GAAAAAGCAAAAGAACGC TAAAGCTATC CAAGCTATCT TGGATGC TTAT CACACAGATGAAGTGAAAAAAGTT ATC AA 
AGATACTTC AGCTGAT ATT C C AC AATGGAA 

SEQ ID NO- 2907: SAG1641 FROM THE COH1 GBS TYPE III STRAIN 
(REVERSE COMPLEMENT) 

AGTTTCAGCAAGCTCAACTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACCTTTTCTGACACTGAAAAAGCACGTTG 
GGATAAAATTGAAAAGCTAGTAGGTGATAAAGCTAAAATCAAATTTACAGAATTTACAGATTATACACAACCAAATCA 
AGCGACAGCCAATAAGGATGTGGATATTAATGCC TTTCAACATTACAATTTC TTAGAAAAC TGGAATAAGGAAAATAA 
GAAAAACTTAATTCCACTTGAAAAGACTTACTTAGCTCCAATTCGTATCTATTCTGAGAAGGTAAAATCTCTTAAAAA 
ATTGAAAAAAGGAGCCACTATTGCAATTCCAAATGATGCAACAAATGGTAGCCGTGCATTGTATGTACTTCAGTCAGC 
AGGTTTAATCAAATTGAATGTTTCTGGTAAGAAGGTTGCAACAGTTGCTAATATCACATCTAATAAAAAGGATATTAA 
TATTCAGGAGTTAGATGCGAGTCAAACACCACGTGCACTCAAAGATGTAGATGCAGCTATTATTAATAATACATACAT 
TGAGCAAGCTAATTTAAAACCTTCAGATGCTATCTTTGTTGAGAAATCAGATAAAAATTCAAAACAATGGATTAATAT 
CATTGCGGGACGTAAAAATTGGAAAAAGCAAAAGAACGCTAAAGCTATCCAAGCTATCTTGGATGCTTATCACACAGA 
TGAAGTGAAAAAAGTTATCAAAGATACTTCAGCTGATATTCCACAATGG 

SEQ ID NO. 2908: SAG1641 FROM THE H36b GBS TYPE lb STRAIN 

AAGAAGTTTCAGCAAGCTCAACTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACCTTTTCTGACACTGAAAAAGCAC 
GTTGGGATAAAATTGAAAAGC TAGTAGGTGAT AAAGCTAAAAT CAAATTTAC AGAAT TTACAGATTATACACAAC CAA 
ATCAAGCGACAGCCAATAAGGATGTGGATATTAATGCCTTTCAACATTACAATTTCTTAGAAAACTGGAATAAGGAAA 
ATAAGAAAAACTTAATTCCACTTGAAAAGACTTACTTAGCTCCAATTCGTATCTATTCTGAGAAGGTAAAATCTCTTA 
AAAAATTGAAAAAAGGAGCCACTATTGCAATTCCAAATGATGCAACAAATGGTAGCCGTGCATTGTATGTCCTTCAGT 
CAGCAGGTTTAATCAAATTGAATGTTTCTGGTAAGAAGGTTGCAACAGTTGCTAATATCACATCTAATAAAAAGGATA 
TTAATATTCAGGAGTTAGATGCGAGTCAAACACCACGTGCACTCAAAGATGTAGATGCAGCTATTATTAATAATACAT 
ACATTGAGCAAGCTAATTTAAAACCTTCAGATGCTATCTTTGTTGAGAAATCAGATAAAAATTCAAAACAATGGATTA 
ATATCATTGCGGGACGTAAAAATTGGAAAAAGCAAAAGAACGCTAAAGC TATC C AAGCTAT CTTGGATGCTTATCACA 
CAGATGAAGTGAAAAAAGTTATCAAAGATACTTCAGCTGATATTCCACAATGG 

SEQ ID NO. 2909: SAG1641 FROM THE JM3190013 GBS TYPE VIII STRAIN 

TTCAGCAAGCTCAACTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACCTTTTCTGACACTGAAAAAGCACGTTGGGA 
TAAAATTGAAAAGCTAGTAGGTGATAAAGCTAAAATCAAATTTACAGAATTTACAGATTATACACAACCAAATCAAGC 
GACAGCCAATAAGGATGTGGATATTAATGCCTTTCAACATTACAATTTCTTAGAAAACTGGAATAAGGAAAATAAGAA 
AAACTTAATTCCACTTGAAAAGACTTACTTAGCTCCAATTCGTATCTATTCTGAGAAGGTAAAATCTCTTAAAAAATT 
GAAAAAAGGAGC CACTATTGC AATTC CAAATGATGC AACAAATGGTAGCCGTGCATTGTATGTC CTT CAGTC AGCAGG 
TTTAATCAAATTGAATGTTTCTGGTAAGAAGGTTGCAACAGTTGCTAATATCACATCTAATAAAAAGGATATTAATAT 
TCAGGAGTTAGATGCGAGTCAAACACCACGTGCACTCAAAGATGTAGATGCAGCTATTATTAATAATACATACATTGA 
GC AAGCTAATTTAAAAC C TTCAGATGC TAT CTT TGTTGAGAAATC AGATAAAAATTCAAAACAATGGATTAATATCAT 
TGCGGGACGTAAAAATTGGAAAAAGCAAAAGAACGCTAAAGCTATCCAAGCTATCTTGGATGCTTATCACACAGATGA 
AGTGAAAAAAGTTATCAAAGATACTTCAGCTGATATTCCACAATGG 

SEQ ID NO. 2 910: SAG1641 FROM THE M732 GBS TYPE III STRAIN 

AATCAAGAAGTTTCAGCAAGCTCAACTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACCTTTTCTGACACTGAAAAA 
GCACGTTGGGATAAAATTGAAAAGCTAGTAGGTGATAAAGCTAAAATCAAATTTACAGAATTTACAGATTATACACAA 
CCAAATCAAGCGACAGCCAATAAGGATGTGGATATTAATGCCTTTCAACATTACAATTTCTTAGAAAACTGGAATAAG 
GAAAATAAGAAAAACTTAATTCCACTTGAAAAGACTTACTTAGCTCCAATTCGTATCTATTCTGAGAAGGTAAAATCT 
CTTAAAAAATTGAAAAAAGGAGCCACTATTGCAATTCCAAATGATGCAACAAATGGTAGCCGTGCATTGTATGTCCTT 
CAGTCAGCAGGTTTAATCAAATTGAATGTTTCTGGTAAGAAGGTTGCAACAGTTGCTAATATCACATCTAATAAAAAG 
GATATTAATAT T CAGGAGTTAGATGCGAGTC AAACA C C ACGTG C ACTC AAAGATGTAGATGCAGCTATTATTAATAAT 
ACATACATTGAGCAAGCTAATTTAAAACCTTCAGATGCTATCTTTGTTGAGAAATCAGATAAA7U\TTCAAAACAATGG 
ATTAATATCATTGCGGGACGTAAAAATTGGAAAAAGCAAAAGAACGCTAAAGCTATCCAAGCTATCTTGGATGCTTAT 
CACACAGATGAAGTGAAAAAAGTTATCAAAGATAC 
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Table 29: Comparative Sequences relating to SAG1641 (YaeC family protein) 



SEQ ID NO. 2911: SAG1641 FROM THE M781 GBS TYPE III STRAIN 

AGTTTCAGCAAGCTCAACTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACCTTTTCTGACACTGAAAAAGCACGTTG 
GGATAAAATTGAAAAGCTAGTAGGTGATAAAGCTAAAATCAAATTTACAGAATTTACAGATTATACACAACCAAATCA 
AGCGACAGCCAATAAGGATGTGGATATTAATGCCTTTCAACATTACAATTTCTTAGAAAACTGGAATAAGGAAAATAA 
GAAAAACTTAATTCCACTTGAAAAGACTTACTTAGCTCCAATTCGTATCTATTCTGAGAAGGTAAAATCTCTTAAAAA 
ATTGAAAAAAGGAGCCACTATTGCAATTCCAAATGATGCAACAAATGGTAGCCGTGCATTGTATGTCCTTCAGTCAGC 
AGGTTTAATCAAATTGAATGTTTCTGGTAAGAAGGTTGCAACAGTTGCTAATATCACATCTAATAAAAAGGATATTAA 
TATTCAGGAGTTAGATGCGAGTCAAACACCACGTGCACTCAAAGATGTAGATGCAGCTATTATTAATAATACATACAT 
TGAGCAAGCTAATTTAAAACCTTCAGATGCTATCTTTGTTGAGAAATCAGATAAAAATTCAAAACAATGGATTAATAT 
CATTGCGGGACGTAAAAATTGGAAAAAGCAAAAGAACGCTAAAGCTATCCAAGCTATCTGGGATGCTTATCACACAGA 
TGAAGTGAAAAAAGTTATCAAAGATACTTCAGCTGATATTCCACAATGG 

SEQ2 901 ATCAAGAAGTTTCAGCAAGCTCAACTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACC 
SEQ2 902 ATCAAGAAGTTTCAGCAAGCTCi^CTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACC 
SEQ2 903 ATCAAGAAGTTTCAGCAAGCTCAACTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACC 
SEQ2 904 ATCAAGAAGTTTCAGCAAGCTCAACTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACC 
SEQ2 905 ATCAAGAAGTTTCAGCAAGCTCAACTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACC 

SEQ2 906 AAGTAAAGTTGTTAAAGTTGGTGTTATGACC 

SEQ2 907 AGTTTCAGCAAGCTCAACTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACC 

SEQ2 908 AAGAAGTTTCAGCAAGCTCAACTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACC 

SEQ2909 TTCAGCAAGCTCAACTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACC 

SEQ2 910 ATCAAGAAGTTTCAGCAAGCTCAACTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACC 
SEQ2 911 - - ■ AGTTTCAGCAAGCTCAACTTCAAGTAAAGTTGTTAAAGTTGGTGTTATGACC 

SEQ2 901 TTTTCTGACACTGAAAAAGCACGTTGGGATAAAATTGAAAAGCTAGTAGGCGATAAAGCT 

SEQ2 902 TTTTCTGACACTGAAAAAGCACGTTGGGATAAAATTGAAAAGCTAGTAGGTGATAAAGCT 

SEQ2 903 TTTTCTGACACTGAAAAAGCACGTTGGGATAAAATTGAAAAGCTAGTAGGTGATAAAGCT 

SEQ2 904 TTTTCTGACACTGAAAAAGCACGTTGGGATAAAATTGAAAAGCTAGTAGGTGATAAAGCT 

SEQ2 905 TTTTCTGACACTGAAAAAGCACGTTGGGATAAAATTGA2UUVGCTAGTAGGTGATAAAGCT 

SEQ2 906 TTTTCTGACACTGAAAAAGCACGTTGGGATAAAATTGAAAAGCTAGTAGGCGATAAAGCT 

SEQ 2 907 TTTTCTGACACTGAAAAAGCACGTTGGGATAAAATTGAAAAGCTAGTAGGTGATAAAGCT 

SEQ2 908 TTTTCTGACACTGAAAAAGCACGTTGGGATAAAATTGAAAAGCTAGTAGGTGATAAAGCT 

SEQ2 909 TTTTCTGACACTGAAAAAGCACGTTGGGATAAAATTGAAAAGCTAGTAGGTGATAAAGCT 

SEQ2 9 10 TTTTCTGACACTGAAAAAGCACGTTGGGATAAAATTGAAAAGCTAGTAGGTGATAAAGCT 

SEQ2 911 TTTTCTGACACTGAAAAAGCACGTTGGGATAAAATTGAAAAGCTAGTAGGTGATAAAGCT 

SEQ2 901 AAAATCAAATTCACAGAATTTACAGATTATACACAACCAAAT CAAGCGACAGCCAATAAG 

SEQ2 902 AAAATCAAATTTACAGAATTTACAGATTATACA 

SEQ2 903 AAAATCAAATTTACAGAATTTACAC^TTATACACAACCAAATC^AGCGACAGCCAATAAG 

SEQ2 904 AAAAT CAAAT T T ACAG AATTTACAGAT T AT ACACAACCAAATCAAGCGACAGC CAATAAG 

SEQ2 905 AAAATCAAATTTACAGAATTTACAGATTATACAC^CC^VATCAAGCGAC^GCC^TAAG 

SEQ2 906 AAAATCAAATTCACAGAATTTACAGATTATACACMCaVAATC^GCGACAGCCAATAAG 

SEQ2 907 AAAATCAAATTTACAGAATTTACAGATTATACACAACCAAATGAAGCGACAGCCAATA 

SEQ2 908 AAAATCAAATTTACAGAATTTACAGATTATACACAACCAAATCAAGCGACAGCCAATAAG 

SEQ2 909 AAAATCAAATTTACAGAATTTACAGATTATACACAACCAAATGAA 

SEQ2 910 AAAAT CAAAT TTACAGAAT TTACAGAT TAT ACACAACCAAAT CAAGCGACAGCCAATAAG 

SEQ2 911 AAAATCAAATTTACAGAATTTACAGATTATACACAACCAAATCAAGCGAC^GCCAATAAG 

SEQ2 901 GATGTGGATATTAATGCCTTTCAACATTACAATTTCTTAGAAAACTGGAATAAGGAAAAT 

SEQ2 902 GATGTGGATATTAATGCCTTTCAACATTACAATTTCTTAGAAAACTGGAATAAGGAAAAT 

SEQ2 903 GATGTGGATATTAATGCCTTTCAACATTACAATTTCTTAGAAAACTGGAATAAGGAAAAT 

SEQ2904 GATGTGGATATTAATGCCTTTCAACATTACAATTTCTTAGAAAACTGGAATAAGGAAAAT 

SEQ2 905 GATGTGGATATTAATGCCTTTCAACATTACAATTTCTTAGAAAACTGGAATAAGGAAAAT 

SEQ2 906 GATGTGGATATTAATGCCTTTCAACATTACAATTTCTTAGAAAACTGGAATAAGGAAAAT 

SEQ 2 907 GATGTGGATATTAATGCCTTTCAACATTACAATTTCTTAGAAAACTGGAATAAGGAAAAT 

SEQ2 908 GATGTGGATATTAATGCCTTTCAACATTACAATTTCTTAGAAAACTGGAATAAGGAAAAT 

SEQ2 909 GATGTGGATATTAATGCCTTTCAACATTACAATTTCTTAGAAAACTGG-AATAAGGAAAAT 

SEQ2 910 GATGTGGATATTAATGCCTTTCAAC^TTACAATTTCTTAGAAAACTGGAATAAGGAAAAT 

SEQ2 911 GATGTGGtATATTAATGCCTTTCAACATTACAATTTCTTAGAAAACTGGAATAAGGAAAAT 

SEQ2 901 AAGAAAAACTTAATTCCACTTGAAAAGACTTACTTAGCCCCAATTCGTATCTATTCTGAG 

SEQ2 902 AAGAAAAACTTAATTCCACTTGAAAAGACTTACTTAGCTCCAATTCGTATCTATTCTGAG 

SEQ2 903 AAGAAAAACTTAATTCCACTTGAAAAGACTTACTTAGCTCCAATTCGTATCTATTCTGAG * 

SEQ2 904 AAGAAAAACTTAATTCCACTTGAAAAGACTTACTTAGCTCCAATTCGTATCTATTCTGAG 

SEQ2 905 AAGAAAAACTTAATTCCACTTGAAAAGACTTACTTAGCTCCAATTCGTATCTATTCTGAG 

SEQ2 906 AAGAAAAACTTAATTCCACTTGAAAAGACTTACTTAGCCCCAATTCGTATCTATTCTGAG 

SEQ2 907 AAGAAAAACTTAATTCCACTTGAAAAGACTTACTTAGCTCCAATTCGTATCTATTCTGAG 

SEQ2 908 AAGAAAAACTTAATTCCACTTGAAAAGACTTACTTAGCTCCAATTCGTATCTATTCTGAG 
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Table 29: Comparative Sequences relating to SAG1641 (YaeC family protein) 



SEQ2909 
SEQ2 910 
SEQ2911 

SEQ2901 
SEQ2902 
SEQ2903 
SEQ2904 
SEQ2905 
SEQ290 6 
SEQ2907 
SEQ2908 
SEQ2909 
SEQ2910 
SEQ2911 

SEQ2901 
SEQ2902 
SEQ2903 
SEQ2904 
SEQ2905 
SEQ2 906 
SEQ2907 
SEQ290 8 
SEQ2 909 
SEQ2910 
SEQ2911 

SEQ2 901 
SEQ2902 
SEQ2903 
SEQ2904 
SEQ2905 
SEQ2906 
SEQ2907 
SEQ290 8 
SEQ290 9 
SEQ2910 
SEQ2911 

SEQ2901 
SEQ2902 
SEQ2903 
SEQ2904 
SEQ2905 
SEQ290 6 
SEQ2907 
SEQ2908 
SEQ2 90 9 
SEQ2 910 
SEQ2911 

SEQ290X 
SEQ2902 
SEQ2903 
SEQ2904 
SEQ2905 
SEQ2 906 
SEQ2907 
SEQ2 90 8 
SEQ29 0 9 
SEQ2910 
SEQ2911 



AAGAAAAACTTAATTCCACTTGAAAAGACTTACTTAGCTCCAATTCGTATCTATTCTGAG 
AAGAAAAACTTAATTCCACTTGAAAAGACTTACTTAGCTCCAATTCGTATCTATTCTGAG 
AAGAAAAACTTAATTCCACTTGAAAAGACTTACTTAGCTCCAATTCGTATCTATTCTGAG 

AAGGTAAAATCTCTTAAAAAATTGAAAAAAGGAGCCACTATTGCAATTCCAAATGATGCA 
AAGGTAAAATCTCTTAAAAAATTGAAAAAAGGAGCC^CTATTGCAATTCGAAATGATG^ 
AAGGTAAAATCTCTTAAAAAATTGAAAAAAGGAGCCACTATTGCAATTCCAAATGATGCA 
AAGGTAAAATCTCTTAAAAAATTGAAAAAAGGAGCC^CTATTGCAATTCCAAATGATGCA 
AAGGTAAAATCTCTTAAAAAATTGAAAAAAGGAGCCACTATTGCAATTCCAAATGATGCA 
AAGGTAAAATCTCTTAAAAAATTGAAAAAAGGAGCCACTATTGCAATTCCAAATGATGCA 
AAGGTAAAATCTCTTAAAAAATTGAAAAAAGGAGCCACTATTGC^^TTCCAAATGATGCA 
AAGGTAAAATCTCTTAAAAAATTGAAAAAAGGAGCCACTATTGCAATTCCAAATGATGCA 
AAGGTAAAATCTCTTAAAAAAT TGAAAAAAGGAGCCACTATTGCAATT CCAAATGATGCA 
AAGGTAAAATCTCTTAAAAAATTGAAAAAAGGAGCCACTATTGCAATTCCAAATGATGCA 
AAGGTAAAATCTCTTAAAAAATTGAAAAAAGGAGCCACTATTGCAATTCCAAATGATGCA 

ACAAATGGTAGCCGTGCATTGTATGTCCTTCAGTCAGCAGGTTTAATCAAATTGAATGTT 
ACAAATGGTAGCCGTGCATTGTATGTCCTTCAGTCAGCAGGTTTAATCAAATTGAATGTT 
ACAAATGGTAGCCGTGCATTGTATGTCCTTCAGTCAGCAGGTTTAATCAAATTGAATGTT 
ACAAATGGTAGCCGTGCATTGTATGTCCTTCAGTCAGCAGGTTTAATCAAATTGAATGTT 
ACAAATGGTAGCCGTGCATTGTATGTCCTTCAGTCAGCAGGTTTAATCAAATTGAATGTT 
ACAAATGGTAGCCGTGCATTGTATGTCCTTCAGTCAGCAGGTTTAATCAAATTGAATGTT 
ACAAATGGTAGCCGTGCATTGTATGTACTTCAGTCAGCAGGTTTAATCAAATTGAATGTT 
ACAAATGGTAGCCGTGCATTGTATGTCCTTCAGTCAGCAGGTTTAATCAAATTGAATGTT 
ACAAATGGTAGCCGTGCATTGTATGTCCTTCAGTCAGCAGGTTTAATCAAATTGAATGTT 
AC^AATGGTAGCCGTGCATTGTATGTCCTTCAGTCAGCAGGTTTAATCAAATTGAATGTT 
AC^AATGGTAGCCGTGC^TTGTATGTCCTTCAGTCAGCAGGTTTAATaUlATTGAATGTT 

TCTGGTAAGAAGGTTGCAACAGTTGCTAATATCACATCTAATAAAAAAGATATTAATATT 
TCTGGTAAGAAGGTTGCAACAGTTGCTAATATCACATCTAATAAAAAGGATATTAATATT 
T CTGGTAAGAAGGT TG GAACAGTTG C TAATATCACAT C TAATAAAAAGGATATTAATAT T 
TCTGGTAAGAAGGTTGCAACAGTTGCTAATATGACATCTAATAAAAAGGATATTAATATT 
T CT GGTAAGAAGGT TGCAACAGTTGC TAATATCACAT CTAATAAAAAGGAT AT T AAT AT T 
TC TGGT AAGAAGGT TG CAACAGT T GC TAATAT CA CAT C TAATAAAAAAGATAT T AATAT T 
TCTGGTAAGAAGGTTGCAACAGTTGCTAATATCACATCTAATAAAAAGGATATTAATATT 
T CTGGTAAGAAGGT TG CAA CAGT TGC TAATATCACAT CTAATAAAAAGGATAT TAATAT T 
TC T GGTAAGAAGGT TG CAA CAGT T GC TAATATCACAT C TAATAAAAAGGATATTAATAT T 
TCTGGTAAGAAGGTTGCAACAGTTGCTAATATCACATCTAATAAAAAGGATATTAATATT 
T CT GGTAAGAAGGT TG CAA CAGT T GC TAATAT CACATCTAATAAAAAGGATAT TAATAT T 

CAGGAGTTAGATGCGAGTCAAACACCACGTGCACTCAAAGATGTAGATGCAGCTATTATT 

CAGGAGTTAGATGCGAGTC^^CTICCACGTGCACTCAAAGATGTAGATGCAGCTATTATT 

(^GGAGTTAGATGCGAGTCAAACACCACGTGCACTCAAAGATGTAGATGCAGCTATTATT 

CAGGAGTTAGATGCCAGTCAAACAC CACGTGCACT CAAAGATGTAGATGCAGCTATTAT T 

CAGGAGTTAGATGCGAGTCAAACACCACGTGC^CTCAAAGATGTAGATGCAGCTATTATT 

C^GGAGTTAGATGCGAGTCAAACACCACGTGCACTCAAAGATGTAGATGCAGCTATTATT 

CAGGAGTTAGATGCGAGTCAAACACCACGTGCACTCAAAGATGTAGATGCAGCTATTATT 

(^GGAGTTAGATGCGAGTCAAACACCACGTGCACTCAAAGATGTAGATG 

CAGGAGTTAGATGCGAGTCAAACACCACGTGCACTCAAAGATGTAGATGCAGCTATTA 

CAGGAGTTAGATGCGAGTCAAACACCACGTGCACTCAAAGATGTAGATGCAGCTATTATT 

CAGGAGTTAGATGCGAGTCAAACACCACGTGCACTCAAAGATGTAGATGCAGCTATTATT 

AATAATACATACATTGAGCAAGCTAATTTAAAACCTTCAGATGCTATCTTTGTTGAGAAA 
AAT AAT ACAT ACAT T GAGCAAGCT AAT T T AAAAC CT T CAG ATG CT AT CT T TGT T GAGAAA 
AATAATACATACATTGAGCAAGCTAATTTAAAACCTTCAGATGCTATCTTTGTTGAGAAA 
AATAATACATACATTGAGCAAGCTAATTTAAAACCTTCAGATGCTATCTTTGTTGAGAAA 
AATAATACATACAT TGAGCAAG CTAAT TTAAAACCTTCAGATG CTATCTTTGTTGAGAAA 
AATAATACATACATTGAGCAAGCTAATTTAAAACCTTCAGATGCTATCTTTGTTGAGAAA 
AATAATACATACATTGAGCAAGCTAATTTAAAACCTTCAGATGCTATCTTTGTTGAGAAA 
AATAATACATACATTGAGCAAGCTAATTTAAAACCTTCAGATGCTATCTTTGTTGAGAAA 
AATAATAC^TACATTGAGCAAGCTAATTTAAAACCTTCAGATGCTATCTTTGTTGAGAAA 
AATAATACATACATTGAGCAAGCTAATTTAAAACCTTCAGATGCTATCTTTGTTGAGAAA 
AATAATACATACATTGAGCAAGCTAATTTAAAACCTTCAGATGCTATCTTTGTTGAGAAA 
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Table 29: Comparative Sequences relating to SAG1641 (YaeC family protein) 



SEQ2901 
SEQ2902 
SEQ2 90 3 
SEQ2904 
SEQ2 905 
SEQ2 90 6 
SEQ2907 
SEQ2908 
SEQ2 90 9 
SEQ2 910 
SEQ2911 

SEQ2901 
SEQ2 902 
SEQ2 903 
SEQ2 904 
SEQ2905 
SEQ2906 
SEQ2907 
SEQ2 90 8 
SEQ2909 
SEQ2 910 
SEQ2 911 

SEQ2901 
SEQ2902 
SEQ2 90 3 
SEQ2904 
SEQ2905 
SEQ2906 
SEQ2907 
SEQ290 8 
SEQ2909 
SEQ2910 
SEQ2911 



TCAGATAAAAATTCAAAACAATGGATTAATATCATTGCGGGACGTAAAAATTGGAAAAAG 

TCAGATAAAAATTCAAAACAATGGATTAATATCATTGCGGGACGTAAAAATTGGAAAAAG 

T CAGAT AAAAAT T CAAAACAAT GG ATTAAT AT CAT T G CGG GACGTAAAAAT TGGAAAAAG 

TCAGATAAAAATTCAAAACAATGGATTAATATCATTGCGGGACGTAAAAATTGGAAAAAG 

TCAGATAAAAATT CAAAACAAT GGATTAATAT CAT TGCGGGACGTAAAAAT TGGAAAAAG 

TCAGATAAAAATTC^AAACAATGGATTAATATCATTGCGGGACGTAAAAATTGGAAAAAG 

TCAGATAAAAATTCAAAACAATGGATTAATATC^TTGCGGGACGTAAAAATTGGAAAAAG 

TCAGATAAAAATTCAAAACAATGGATTAATATCATTGCGGGACGTAA 

TCAGATAAAAATTCAAAAC^TGGATTAATATCATTGCGGGACGTAAAAATTGGAAAAAG 

T CAGAT AAAAATTCAAAACAAT GGATTAATAT C^TTGCGGGACGTAAAAATTGGAAAAAG 

TCAGATAAAAATTCAAAAOy^TGGATTAATATCATTGCGGGACGTAAAAATTGGAAAAAG 

CAAAAGAACGCTAAAGCTATCCAAGCTATCTTGGATGCTTATCACACAGATGAAGTGAAA 
CAAAAGAACGCTAAAGCTATCCAAGCTATCTTGGATGCTTATCACA 

CAAAAGAACGCTAAAGCTATCCAAGCTATCTTGGATGCTTATCACACAGATGAAGTGAAA 
CAAAAGAACGCTAAAGCTATCCAAGCTATCTTGGATGCTTATCACACAGATGAAGTGAAA 
CAAAAGAACGCTAAAGCTATCCAAGCTATCTTGGATGCTTATCACACAGATGAAGTGAAA 
OWAGAACGCTAAAGCTATCC^^GCTATCTTGGATGCTTATCACACAGATGAAGTGAAA 
CAAAAGAACGCTAAAGCTATCCAAGCTATCTTGGATGCTTATCACAC^GATGAAGTGAAA 
CAAAAGAACGCTAAAGCTATCCAAGCTATCTTGGATGCTTATCACACAGATGAAGTGAAA 
CAAAAGAACGCTAAAGCTATCCAAGCTATCTTGGATGCTTATCACACAGATGAAGTGAAA 
CAAAAGAACGCTAAAGCTATCCAAGCTATCTTGGATGCTTATCACACAGATGAAGTGAAA 
CAAAAGAACGCTAAAGCTATCCAAGCTATCTGGGATGCTTATCACACAGATGAAGTGAAA 

AAAGTTATCAAAGATACTTCAGCTGATATTCCACAATGGAACCCAGCTTTCTTGTACAA 

AAAGTTATGAAAGATACTTCAGCTGATATTCCACAATGG 

AAAGT TAT CAAAGATAC TT CAG C T GATATT C CAC 

AAAGT T AT CAAAGAT A C T T CAG C T GATATT CCACAATGG 

AAAGTTATCAAAGATACTTCAGCTGATATTCGACAATGG- 

AAAGTT AT CAAAGAT A C T T CAG C TGAT ATT C CACAATGGAA 

AAAGTTATCAAAGATACTTCAGCTGATATT CCACAATGG 

AAAGT TAT CAAAGATACTT CAG CT GATATT CCACAATGG 

AAAGTTATCAAAGATACTTCAGCTGATATT CCACAATGG 

AAAGTTATCAAAGATAC 

AAAGTTATCAAAGATACTTCAGCTGATATT CCACAATGG 



>SEQ ID NO 2950: 35_090 frame: 1 

NQEVSASSTSSKWKVGVMTFSDTEKARWDKI EKLVGDKAKI KFTEFTD YTQPNQATANK 
DVDINAFQHYNFLENWNKENKKNLIPI1EKTYI1APIR PNDA 
TNGSRALYVLQSAGLIKLNVSGKKVATVANITSNKKDINIQELDASQTPRAIiKDVDAAII 
NNTYIEQANLKPSDAIFVEKSDKNSKQWINIIAGRKNWKKQKNAKAIQAILDAYHTDEVK 
KVI KDTSADI PQWNPAFLY 



>SEQ ID NO 2951: 35_1169NT frame: 3 

QEVSASSTSSKVVKVGVMTFSDTEKARWDKIEKLVGDKAKIKFTEFTDYTQPNQATANKD 
VDINAFQHYNFLENWNKENKKNLIPLEKTYLAPIRIYSEKVKSLKKLKKGATI 
NGSRALYVLQSAGLIKLNVSGKKVATVANITSNKKDINIQELDASQTPRALKDVDAAIIN 
NT Y I EQANIiKP S DA I FVEKSDKNS KQWINI I AGRKNWKKQKNAKAI QAI LDAYHTDEVKK 
VIKDTSADIPQW 

>SEQ ID NO 2952: 35_18RS21 frame: 1 

NQEVSASSTS S KWKVGVMTFSDTEKARWDKI EKLVGDKAKIKFTEFTDYTQPNQATANK 
DVDINAFQHYNFLENWNKENKICNLIPLEKTYIiAPIRIYSEKVKSLKKLKKGATIAIPNDA 
TNGSRAL YVLQSAGLI KLNVSGKKVATVANITSNKKDINI QELDAS QTPRALKDVDAAI I 
NNTYIEQANLKPSDAI FVEKSDKNS KQWINI I AGRKNWKKQKNAKAI QAI LDAYHTDEVK 
KVI KDTSADI P 

>SEQ ID NO 2953 : 35_2 603 frame: 1 

NQEVSASSTSSKWKVGVMTFSDTEKARWDKI EKLVGDKAKIKFTEFTDYTQPNQATANK 
DVDINAFQHYNFLENWNKENKKNLI PLEKTYIiAPIRI YSEKVKSLKKLKKGATIAI PNDA 
TNGSRALWLQSAGLIKLNVSGKKVATVANITSNKKDINIQELDASQTPRALKDVDAAII 
NNTYIEQANLKPSDAIFVEKSDKNSKQWINIIAGRKNWKKQKNAKAIQAILDAYHTDEVK 

KVIKDTSADIPQW 
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Table 29: Comparative Sequences relating to SAG1641 (YaeC family protein) 

>SEQ ID NO 2954:35_A909 frame: 1 

NQEVSASSTSSKWKVGVMTFSDTEKARWDKIEKLVGDKAKIKFTEFTDYTQPNQATANK 
DVDINAFQHYNFLBNWNKENKKNIilPLEKTYLAPIRIYSEKVKSLKKLKKGATIAIPNDA 
TNGSRALYVLQSAGLI KLNVSGKKVATVANITSNKKDINI QELDASQTPRALKDVDAAI I 
NNTYIEQANLKPSDAI FVEKSDKNSKQWINI IAGRKNWKKQKNAKAIQAILDAYHTDEVK 
KVIKDTSADIPQW 

>SEQ ID NO 2955:35_CJB110 frame: 2 

SK3A7KVGVMTFSDTEKARWDKIEKLVGDKAKIKFTEFTDYTQPNQATANKDVDINAFQHY 
NFLENWNKENKKtfLIPLEKTYI^ 

QSAGLI KLNVSGKKVATVANI TSNKKDINI QELDASQTPRALKDVDAAI INNTYIEQANL 
KPSDAI FVEKSDKNSKQWINI IAGRKNWKKQKNAKAI QAILDAYHTDEVKKVI KDTSAD I 
PQW 

>SEQ ID NO 2 956:35_COHl frame: 2 

VSASSTSSKWKVGVMTFSDTEKARWDKIEKLVGDKAKIKFTEFTDYTQPNQATANKDVD 
INAFQHYNFLENWNKENKKNLIPLEKTYLAPIRIYSEKVKSLKKLKKGATIAIPNDATNG 
SFJ^LYVLQSAGLIKLNVSGKKVATVANITSNKKDINIQELDASQTPRALKDVDAAIINNT 
YIEQANLKPSDAIFVEKSDKNSKQWINIIAGRKNWKKQKNAKAIQAILDAYHTDEVKKVI 

KDTSAD I PQW 

>SEQ ID NO 2957:35_H36B frame: 3 

EVS AS S TS SKWKVGVMTFSDTEKARWDKI EKLVGDKAKI KFTEFTD YTQPNQATANKDV 
DINAFQHYNFLENWNKENKKNLIPLEKTYLAPIRIYSEKVKSLKICLKKGATI 
GSRALYVLQSAGLIKLNVSGKKVATVANITSNKECD INI QELDASQTPRALKDVDAAI INN 
TYIEQANLKPSDAIFVEKSDKNSKQWINIIAGRKNWKKQKNAKAIQAILDAYHTDEVKKV 

IKDTS AD I PQW 

>SEQ ID NO 2958 : 35_JM9130013 frame: 2 

SAS STS SKVVKVGVMTFSDTEKARWDKIEKLVGDKAKI KFTEFTDYTQPNQATANKDVD I 
NAFQHYNFLENWNKENKKNLI PLEKTYLAP I RI YSEKVKSLKKLKKGATIAI PNDATNGS 
RAL YVLQSAGLIKLNVSGKKVATVANI TSNKKDINIQELDASQTPRALKDVDAAI INNT Y 
I EQANLKPSDAI FVEKSDKNSKQWINI IAGRKNWKKQKNAKAI QAILDAYHTDEVKKVI K 
DTSADIPQW 

>SEQ ID NO 2959:35_M732 frame: 1 

NQEVSASSTSSKWKVGVMTFSDTEKARWDKI EKLVGDKAKI KFTEFTD YTQPNQATANK 
DVD INAFQHYNFLENWNKENKKNL I PLEKTYLAP I RIYSEKVKSLKKLKKGATIAIPNDA 
TNGSRAL YVLQSAGLIKLNVSGKKVATVANI TSNKKDINIQELDASQTPRALKDVDAAI I 
NNTYIEQANLKPSDAI FVEKSDKNSKQWINI IAGRKNWKKQKNAKAIQAILDAYHTDEVK 

KVIKD 

>SEQ ID NO 2960:35_M781 frame: 2 

VSAS STS S KWKVGVMTFSDTE KARWDKI EKLVGDKAKI KFTE FTDYTQPNQATANKDVD 
INAFQHYNFLENWNKENKKNLIPLEKTYLAPIRIYSEKVKSLKKLKKGATIAIPNDATNG 
SRALYVLQSAGLIKLNVSGKKVATVANITSNKXDINIQELDASQTPRALKDVDAAIINNT 
YIEQANLKPSDAIFVEKSDKNSKQWINIIAGRKNWKKQKNAKAIQAIWDAYHTDEVKKVI 

KDTSAD I PQW 

SEQ2 950 QEVSASSTSSKVVKVGVMTFSDTEKARWDKIEKLVGDKAKIKFTEFTD YTQPNQATANK 

SEQ2 951 QEVSASSTSSKVVKVGVMTFSDTEKARWDKIEKLVGDKAKIKFTEFTD YTQPNQATANK 

SEQ2952 QEVSASSTSSKVVKVGVMTFSDTEKARWDKIEKLVGDKAKIKFTEFTD YTQPNQATANK 

SEQ2953 QEVSASSTSSKVVKVGVMTFSDTEKARWDKIEKLVGDKAKIKFTEFTDYTQPNQATANK 

SEQ2954 QEVSASSTSSKVVKVGVMTFSDTEKARWDKIEKLVGDKAKIKFTEFTDYTQPNQATANK 

SEQ2955 SKWKVGVMTFSDTEKARWDKI EKLVGDKAKIKFTEFTDYTQPNQATANK 

SEQ2 95 6 - -VSAS STSSKWKVGVMTFSDTEKARWDKIEKLVGDKAKI KFTEFTD YTQPNQATANK 

SEQ2957 -EVSASSTSSKVVKVGWTFSDTEKARWDKI EKLVGDKAKI KFTEFTD YTQPNQATANK 

SEQ2 958 SASSTSSKWKVGVMTFSDTEKARWDKIEKLVGDKAKI KFTEFTD YTQPNQATANK 

SEQ2 959 QE^ SAS STS SKWKVGVMTFSDTEKARWDKI EKLVGDKAKI KFTEFTD YTQPNQATANK 

SEQ2 960 - -VSASSTSSKWKVGVMTFSDTEKARWDKIEKLVGDKAKIKFTEFTDYTQPNQATANK 

SEQ2 950 DVD INAFQHYNFLENWNKENKKNL I PLEKTYLAPIRIYSEKVKSLKKLKKGATIAIPNDA 

SEQ2 951 DVD INAFQHYNFLENWNKENKKNL I PLEKTYLAP I RI YSEKVKSLKKLKKGATIAI PNDA 

SEQ2952 DVDINAFQHYNFLENWNKENKKNLIPLEKTYLAPIRIYSEKVKSLKKLKKGATIAIPNDA 

SEQ2953 DVDINAFQHYNFLENWNKENKKNLIPLEKTYIxAPIRI YSEKVKSLKKLKKGATIAI PNDA 

SEQ2 954 DVDINAFQHYNFLENWNKENKKNLIPLEKTYLAPIRIYSEKVKSLKKLKKGATIAIPNDA 

SEQ2955 ' DVDINAFQHYNFLENWNKENKKNLIPLEKTYLAPIRIYSEKVKSLKKLKKGATIAIPNDA 

SEQ295 6 DVDINAFQHYNFLENWNKENKKNLIPLEKTYLAPIRI YSEKVKSLKKLKKGATIAIPNDA 

SEQ2957 DVD INAFQHYNFLENWNKENKKNLIPLEKTYLAPIRI YSEKVKSLKKLKKGATIAI PNDA 

SEQ2958 DVDINAFQHYNFLENWNKENKKNLIPLEKTYLAPI RIYSEKVKSLKKLKKGATIAIPNDA 
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Table 29: Comparative Sequences relating to SAG1641 (YaeC family protein) 



SEQ2959 
SEQ2960 

SEQ2 95 0 
SEQ2951 
SEQ2952 
SEQ2 953 
SEQ2954 
SEQ2955 
SEQ2956 
SEQ2957 
SEQ2 95 8 
SEQ2959 
SEQ2960 

SEQ2950 
SEQ2951 
SEQ2 952 
SEQ2 953 
SEQ2 954 
SEQ2955 
SEQ2 956 
SEQ2957 
SEQ2958 
SEQ2959 
SEQ2960 



DVDINAFQHYNFLENWNKEmoaSTLIPLEKTYr^PIRIYSEKVKSLKKLKKGATIAIPNDA 
DVDINAFQHyNFLENWNKENKKNIilPLEKTYIiAPIRIYSEKV 

TNGSRAL YVLQ S AGL I KLNVSGKKVATVAN I TSNKKDI NI QELDAS QTPRALKDVDAAI I 
TNGSRALYVLQSAGLIKIiNVSGKKVATVANITSNKKDINIQELDASQTPRALKDVDAAII 
TNGSRALYVLQSAGLIKLNVSGKKVATVANITSNKKDIN^ 

TNGSRALYVLQSAGLI KLNVSGKKVATVAN I TSNKKDI NI QELDAS QTPRALKDVDAAI I 
TNGSRALYVLQSAGLIKIjNVSGKKVATV^ 

TNGSRALYVLQSAGLI KLNVSGKKVATVAN I TSNKKDINI QELDAS QTPRALKDVDAAI I 
TNGSRALYVLQSAGLIKIiNVSGKKVATVANITSNKKDINIQELDASQTPRALKDVDAAII 
TNGSRALYVLQSAGLI KLNVSGKKVATVANI TSNKKDINI QELDAS QTPRALKDVDAAI I 
TNGSRALYVLQSAGLI KLNVSGKKVATVAN I TSNKKDINI QELDAS QTPRALKDVDAAI I 
TNGSRALYVLQSAGLI KLNVSGKKVATVANI TSNKKDINI QELDAS QTPRALKDVDAAI I 
TNGSRALYVLQSAGLI KliNVSGKKVATVANI TSNKKDINI QELDAS QTPRALKDVDAAI I 

NNTYIEQANLKPSDAIFVEKSDKNSKQWINIIAGRKNWKKQKNAKAIQAILDAYHTDEVK 
NNTYIEQANLK^SDAIFVEKSDKNSKQWINIIAGRKNWKKQKNAKAIQAILDAYHTDEVK 
NNTYI EQANLKPSDAI FVEKSDKNSKQWINI I AGRKNWKKQKNAKAI QAILDAYHTDEVK 
NNT YI EQANLKPSDAI FVEKSDKNSKQWINI I AGRKNWKKQKNAKAI QAILDAYHTDEVK 
NNTYIEQANLKPSDAI FVEKSDKNSKQWINI IAGRKNWKKQKNAKAIQAILDAYHTDEVK 
NNTYIEQANLKPSDAIFVEKSDKNSKQWINIIAGRKNWKKQKNAKAIQAILDAYHTDEVK 
NNTYIEQANLKPSDAI FVEKSDKNSKQWINI IAGRKNWKKQKNAKAIQAILDAYHTDEVK 
NNTYIEQANLKPSDAI FVEKSDKNSKQWINI IAGRKNWKKQKNAKAIQAILDAYHTDEVK 
NNTYIEQANLKPSDAI FVEKSDKNSKQWINI IAGRKNWKKQKNAKAIQAILDAYHTDEVK 
NNTYIEQANLKPSDAI FVEKSDKNSKQWINI IAGRKNWKKQKNAKAIQAILDAYHTDEVK 
NNTYIEQANLKPSDAI FVEKSDKNSKQWINI I AGRKNWKKQKNAKA I QAIWDAYHTDEVK 



SEQ2950 
SEQ2951 
SEQ2952 
SEQ2953 
SEQ2954 
SEQ2955 
SEQ2956 
SEQ2957 
SEQ2958 
SEQ2959 
SEQ2960 



KVI KDTS ADI PQWNPAFLY 

KVIKDTSADIPQW 

KVI KDTSAD I P 

KVIKDTSADIPQW 

KVIKDTSADIPQW 

KVIKDTSADIPQW 

KVIKDTSADIPQW 

KVIKDTSADIPQW 

KVIKDTSADIPQW 

KVIKD 

KVIKDTSADIPQW 
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Table 30: Comparative Sequences relating to SAG2147 
(protein of uknown function / lipoprotein, putative) 

SEQ ID NO. 3001: SAG2147 FROM THE 1169NT1 GBS TYPE V STRAIN 
(REVERSE COMPLEMENT) 

AAAAGTTCACAAGTTACTACTGAATCTTTGTCAAAAGCAGATAAAGTTCGCGTAGCC 

AAAAAATCAAAAATGACTAAGGCGACATCTAAATCAAAAGTAGAAGATGTAAAACAGGCT 

CCAAAACCTTCTCAGGCATCTAATGAAGTCCCAAAATCAAGTTCTCAATCTACAGAAGCT 

AATTCTCAGCAACAAGTTACTGCGAGTGAAGAGGCGGCTGTAGAACAAGCAGTTGTAACA 

GAAAATACCCCTGCTACCAGTCAGGCACAACAAACTTATGCTGTTACTGAGACAACTTAC 

AAACCTGCTCAACACCAGACAAGTGGCCAAGTATTGAGCAATGGAAATACTGCAGGGGCG 

GTCGGATCTGCTGCTGCAGCACAAATGGCTGCTGCAACAGGAGTCCCTCAGTCTACTTGG 

GAACATATTATTGCCCGTGAATCAAATGGTAATCCTAATGTTGCTAATGCCTCAGGAGCT 

TCAGGACTTTTCCAAACGATGCCAGGTTGGGGTTCAACAGCTACAGTTCAGGATCAAGTT 

AATTCAGCTATTAAAGCTTATCGTGCTCAAGGTTTATCAGCTTGGGGTTAC 

SEQ ID NO- 3002: SAG2147 FROM THE 18RS21.GBS TYPE II STRAIN 
(REVERSE COMPLEMENT) 

AAAAGTT CACAAGTTAC TAC TGAATCTTTGTCAAAAGC AGATAAAGTT C 
GCGTAGCCAAAAAATCAAAAATGACTAAGGCGACATCTAAATCAAAAGTAGAAGATGTAA 
AACAGGC TC CAAAACCTT CT CAGGC ATCTAATGAAGC CCCAAAAT C AAGTTCTC AATCTA 
C AGAAGCTAATT CT CAGCAACAAGTTAC TGCGAGTGAAGAGGCAGCTGTAGAAC AAGCAG 
TTGTAACAGAAAAC AC CCCTGC TACCAGTC AGGC ACAACAAGCT TATGCTGTTACTGAGA 
CAACTTATAGACCTGCTCAACACCAGACGAGTGGCCAAGTATTGAGTAATGGAAATACTG 
CAGGGGCTATTGGCTCAGCAGCTGCAGCACAAATGGCTGCTGCAACAGGAGTCCCTCAGT 
CTACTTGGGAACATATTATTGCCCGTGAATCAAATGGTAATCCTAATGTTGCTAATGCCT 
C AGGAGCTT C AGGACTTTTC C AAACGATGC CAGGTTGGGGTTCAACAG CTACAGTT C AGG 
ATCAAGTTAATTCAGCTATTAAAGCTTATCGTGCTCAAGGTTTATCAGCTTGGGGTTAC 

SEQ ID NO. 3003: SAG2147 FROM THE 2603 V/R GBS TYPE V STRAIN 
(REVERSE COMPLEMENT) 

AAAAGTTCACAAGTTACTACTGAATCTTTGTCAAAAGCAGATAAAGT 

TCGCGTAGCCAAAAAATCAAAAATGACTAAGGCGACATCTAAATCAAAAGTAGAAGATGT 

AAAACAGGCTCCAAAACCTTCTCAGGCATCTAATGAAGCCCCAAAATCAAGTTCTCAATC 

TACAGAAGCTAATTCTCAGCAACAAGTTACTGCGAGTGAAGAGGCAGCTGTAGAACAAGC 

AGTTGTAAC AGAAAACACCCC TGCTAC C AGT CAGGC AC AACAAG CT TATGCTGTTACTG A 

GACAACTTATAGACCTGCTCAACACCAGACGAGTGGCCAAGTATTGAGTAATGGAAATAC 

TGCAGGGGCTATTGGCTCAGCAGCTGCAGCACAAATGGCTGCTGCAACAGGAGTCCCTCA 

GTCTACTTGGGAACATATTATTGCCCGTGAATCAAATGGTAATCCTAATGTTGCTAATGC 

CTCAGGAGCTTCAGGACTTTTCCAAACGATGCCAGGTTGGGGTTCAACAGCTACAGTTCA 

GGATCAAGTTAAT TC AGC TAT TAAAGC TTATCGTGCT CAAGGTTTATCAGCTTGGGGTTA 

C 

SEQ ID NO. 3004: SAG2147 FROM THE 090 GBS TYPE la STRAIN 
(REVERSE COMPLEMENT) 

TAGCCAAAAAATCAAAAATGATTAAGGCGACATCTAAATCAAAAGTAGAAGATGTAAAAC 
AGGC TCCAAAAC CTTCT CAGG CAT CTAATGAAGC C CCAAAATCAAGTTC TCAATCTACAG 
AAGCTAATTCTCAGCAACAAGTTACTGCGAGTGAAGAGGCAGCTGTAGAACAAGCAGTTG 
TAACAGAAAACACCCCTGCTACCAGTCAGGCACAACAAGCTTATGCTGTTACTGAGACAA 
CTTATAGACCTG C T C AACACC AGACGAGTGGCCAAGTATTG AGT AATGGAAAT ACTGC AG 
GGGCTATTGGCTCAGCAGCTGCAGCACAAATGGCTGCTGCAACAGGAGTCCCTCAGTCTA 
CTTGGGAACATATTATTGCCCGTGAATCAAATGGTAATCCTAATGTTGCTAATGCCTCAG 
GAGCTTCAGGACTTTTCCAAACGATGCCAGGTTGGGGTTCAACAGCTACAGTTCAGGA 

SEQ ID NO. 3005: SAG2147 FROM THE A909 GBS TYPE la STRAIN 
(REVERSE COMPLEMENT) 

AAGGCGAC ATC T AAAT CAAAAGTAGAAG ATGTAAAAC AGGCTCC AAAAC CTTCT CAGGC A 
TCTAATGAAGCCCCAAAATCAAGTTCTCAATCTACAGAAGCTAATTCTCAGCAACAAGTT 
ACTGCGAGTGAAGAGGCAGCTGTAGAACAAGCAGTTGTAACAGAAAACACCCCTGCTACC 
AGTCAGGCACAACAAGCTTATGCTGTTACTGAGACAACTTATAGACCTGCTCAACACCAG 
ACAAGTGGCCAAGTATTGAGTAATGGAAATACTGCAGGGGCTATTGGCTCAGCAGCTGCA 
GCACAAATGGCTGCTGCAACAGGAGTCCCTCAGTCTACTTGGGAACATATTATTGCCCGT 
GAATCAAATGGTAATCCTAATGTTGCTAATGCCTCAGGAGCTTCAGGACTTTTCCAAACG 
ATGCCAGGTTGGGGTTCAACAGCTACAGTTCAGAATCAAGTTAATTCAGCTATTAAAGCT 
TATCGTGCTCAAGGTTTATCA 



651 



WO 2004/018646 



PCT/US2003/026827 



Table 30: Comparative Sequences relating to SAG2147 
(protein of uknown function / lipoprotein, putative) 

SEQ ID NO. 3006: SAG2147 FROM THE CJB110 GBS NONTYPEABLE STRAIN 
(REVERSE COMPLEMENT) 

AATCTTTGTCAAAAGCAGATAAAGTTCGCGTAGCCAAAAAATCAAAAATGACTAAGGCGA 
CATCTAAATCAAAAGTAGAAGATGTAAAACAGGCTCCAAAACCTTCTCAGGCATCTAATG 
AAGCCCCAAAAT CAAGTTC TCAAT CTAC AGAAGC TAATTC TCAGC AAC AAGTTAC TGCGA 
GTGAAGAGGCAGC TGTAGAAC AAGCAGTTGTAACAGAAAACAC CC CTGCTAC CAGTCAGG 
CACAACAAGCTTATGCTGTTACTGAGACAACTTATAGACCTGCTCAACACCAGACGAGTG 
GC CAAGTATTGAGTAATGGAAATACTGC AGGGGC TATTGGCT CAGCAG C TGC AGCAC AAA 
TGGCTGCTGCAACAGGAGTCCCTCAGTCTACTTGGGAACATATTATTGCCCGTGAATCAA 
ATGGTAATCCTAATGTTGCTAATGCCTCAGGAGCTTCAGGACTTTTCCAAACGATGCCAG 
GTTGGGGTTCAACAGCTACAGTTCAGGATCAAGTTAATTCAGCTATTAAAGCTTATCGTG 
CTCAAGGTTTATCAGCTTGGGGTTAC 

SEQ ID NO, 3007: SAG2147 PROM THE COH1 GBS TYPE III STRAIN 
(REVERSE COMPLEMENT) 

AAAAGTTCACAAGTTACTACTGAATCTTTGTCAAAAGCAGATAA 

AGTTCGCGTAGCCAAAAAATCAAAAATGACTAAGGCGACATCTAAATCAAAAGTAGAAGA 
TGTAAAACAGGCTCCAAAACCTTCTCAGGCATCTAATGAAGCCCCAAAATCAAGTTCTCA 
AT C T ACAGAAG C T AATT C T C AG C AAC AAGT TA C TG C G AGTGAAGAGG C GG C TGTAGAAC A 

AGCAGTTGTAAC AGAAAATACC CCTGC TACCAGTCAGGCACAACAAAC TTATGCTGTTAC 
TGAGACAACTTACAAACCTGCTCAACACCAGACAAGTGGCCAAGTATTGAGCAATGGAAA 
TACTGCAGGGGCGGTCGGATCTGCTGC^GCAGCACAAATGGCTGCTGCAACAGGAGTCCC 
TCAGTCTACTTGGGAACATATTATTGCCCGTGAATCAAATGGTAATCCTAATGTTGCTAA 
TGCCTCAGGAGCTTCAGGACTTTTCCAAACGATGCCAGGTTGGGGTTCAACAGCTACAGT 
TCAGGATCAAGTTAATTCAGCTATTAAAGCTTATCGTGCTCAAGGTTTATCAGCTTGGGG 
TTAC 

SEQ ID NO. 3008: SAG2147 FROM THE H36b GBS TYPE lb STRAIN 
(REVERSE COMPLEMENT) 

AAAAGTT C ACAAGTTAC TAC TGAATCTTTGTC AAAAGC 

AGATAAAGTTCGCGTAGCCAAAAAATCAAAAATGACTAAGGCGACATCTAAATCAAAAGT 
AGAAGATGTAAAACAGGCTC C AAAAC CTTCTCAGGC AT CTAATGAAGC CCC AAAATC AAG 
TTCTCAATCTACAGAAGCTAATTCTCAGCAACAAGTTACTGCGAGTGAAGAGGCAGCTGT 
AGAACAAGC AGTTGTAACAGAAAACAC C C C TGCTAC C AGT CAGGC ACAACAAGCTTATGC 
TGTTACTGAGACAACTTATAGACCTGCTCAACACCAGACAAGTGGCCAAGTATTGAGTAA 
TGGAAATACTGCAGGGGCTATTGGCTCAGCAGCTGCAGCACAAATGGCTGCTGCAACAGG 
AGTCCC TC AGTC TACTTGGGAAC ATATTATTGCC CGTGAATCAAATGGTAATCCTAATGT 
TGCTAATGCCTCAGGAGCTTCAGGACTTTTCCAAACGATGCCAGGTTGGGGTTCAACAGC 
TACAGTTCAGGATCAAGTTAATTCAGCTATTAAAGCTT 

SEQ ID NO. 3009: SAG2147 FROM THE M732 GBS TYPE III STRAIN 
(REVERSE COMPLEMENT) 

AAAAGTTCACAAGTTACTACTGAATCTTTGTCAAAAGCAGATAAAGTTCGCGTAGC 

CAAAAAATCAAAAATGACTAAGGCGACATCTAAATCAAAAGTAGAAGATGTAAAACAGGC 

TCCAAAACCTTCTCAGGCATCTAATGAAGCCCCAAAATCAAGTTCTCAATCTACAGAAGC 

TAATTCTCAGCAACAAGTTACTGCGAGTGAAGAGGCGGCTGTAGAACAAGCAGTTGTAAC 

AGAAAATACCCCTGCTACCAGTCAGGCACAACAAACTTATGCTGTTACTGAGACAACTTA 

CAAACCTGCTCAACACCAGACAAGTGGCCAAGTATTGAGCAATGGAAATACTGCAGGGGC 

GGTCGGATCTGCTGCTGCAGCACAAATGGCTGCTGCAACAGGAGTCCCTCAGTCTACTTG 

GGAACATATTATTGCCCGTGAATCAAATGGTAATCCTAATGTTGCTAATGCCTCAGGAGC 

TTCAGGACTTTTCCAAACGATGCCAGGTTGGGGTTCAACAGCTACAGTTCAGGATCAAGT 

TAATTCAGCTATTAAAGCTTATCGTGCTCAAGGTTTATCAGCTTGGGGTTA 
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Table 30: Comparative Sequences relating to SAG2147 
(protein of uknown function / lipoprotein, putative) 

SEQ ID NO. 3010: SAG2147 FROM THE M781 GBS TYPE III STRAIN 
(REVERSE COMPLEMENT) 

GTAACCC C AAGC TGATAAACCTTGAGCACGATAAGC T TTAATAGCTGAATTAACTTGAT C 
CTGAACTGTAGCTGTTGAACCCCAACCTGGCATCGTTTGGAAAAGTCCTGAAGCTCCTGA 
GGCATTAGCAACATTAGGATTACCATTTGATTCACGGGCAATAATATGTTCCCAAGTAGA 
CTGAGGGACTCCTGTTGCAGCAGCCATTTGTGCTGCAGCAGCAGATCCGACCGCCCCTGC 
AGTATTTCCATTGCTCAATACTTGGCCACTTGTCTGGTGTTGAGCAGGTTTGTAAGTTGT 
CTCAGTAACAGCATAAGTTTGTTGTGCCTGACTGGTAGCAGGGGTATTTTCTGTTACAAC 
TGCTTGTTCTACAGCCGCCTCTTCACTCGCAGTAACTTGTTGCTGAGAATTAGCTTCTGT 
AGATTGAGAACTTGATTTTGGGGCTTCATTAGATGCCTGAGAAGGTTTTGGAGCCTGTTT 
TACATCTTCTACTTTTGATTTAGATGTCGCCTTAGTCATTTTTGATTTTTTGGCTACGCG 
AACTTTATCTGCTTTTGACAAAGA 



SEQ3001 
SEQ3 002 
SEQ3003 
SEQ3004 
SEQ3005 
SEQ3007 
SEQ3008 
SEQ300 9 
SEQ3010 



AGGCGACATCTAAATCAAAAGTAGAAGATGTAAAACAGGCTCCAAAACCTTCTCAGGCA 



SEQ3001 
SEQ3002 
SEQ3 00 3 
SEQ3004 
SEQ3005 
SEQ3 007 
SEQ3 00 8 
SEQ3009 
SEQ3010 



CTAATGAAGCCCCAAAATCAAGTTCTCAATCTACAGAAGCTAATTCTCAGCAACAAGTT 



SEQ3001 
SEQ3002 
SEQ3 003 
SEQ3004 
SEQ3005 
SEQ3007 
SEQ3008 
SEQ3009 
SEQ3010 



CTGCGAGTGAAGAGGCAGCTGTAGAACAAGCAGTTGTAACAGAAAACACCCCTGCTACC 



SEQ3 001 
SEQ3 0 02 
SEQ3003 
SEQ3004 
SEQ3 00 5 
SEQ3007 
SEQ3008 
SEQ30 09 
SEQ3010 



GTCAGGCACAACAAGCTTATGCTGTTACTGAGACAACTTATAGACCTGCTCAACACCAG 



SEQ3001 
SEQ30G2 
SEQ3003 
SEQ3004 
SEQ3005 
SEQ3007 
SEQ3 008 
SEQ3009 
SEQ3010 



CAAGTGGCCAAGTATTGAGTAATGGAAATACTGCAGGGGCTATTGGCTCAGCAGCTGCA 
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Table 30: Comparative Sequences relating to SAG2147 
(protein of uknown function / lipoprotein, putative) 

SEQ3001 

SEQ3002 

SEQ3003 

SEQ3004 

SEQ3 005 CACAAATGGCTGCTGCAACAGGAGTCCCTCAGTCTACTTGGGAACATATTATTGCCCGT 

SEQ3007 

SEQ3008 

SEQ3009 

SEQ3010 

SEQ3001 

SEQ3002 

SEQ3003 

SEQ3004 

SEQ3 005 AATCAAATGGTAATCCTAATGTTGCTAATGCCTCAGGAGCTTCAGGACTTTTCCAAACG 

SEQ3007 

SEQ3008 

SEQ3009 

SEQ3010 



SEQ3001 

SEQ3002 

SEQ3003 

SEQ3004 

SEQ3 005 TGCCAGGTTGGGGTTCAACAGCTACAGTTCAGAATCAAGTTAATTCAGCTATTAAAGCT 

SEQ3007 

SEQ3008 

SEQ3009 

SEQ3010 

SEQ3 001 AAAGTTCACAAGTTACTACTGAATCTTTGTCAAAAGCAGATAAAGTTCGCGTAGCCAAA 

• SEQ3 002 AAAGTTC^CAAGTTACTACTGAATCTTTGTCAAAAGCAGATAAAGTTCGCGTAGCCAAA 

SEQ3003 AAAGTT CACAAGTTACTACTGAAT CTTTGT CAAAAGCAGATAAAGTTCGCGTAGCCAAA 

SEQ3004 TAGCCAAA 

SEQ3 005 ATCGTGCTCAAGGTTTATCASAATCTTTGTCAAAAGCAGATAAAGTTCGCGTAGCCAAA 

SEQ3 007 AAAGTT CACAAGTTACTACTGAAT CTTTGT CAAAAGCAGATAAAGTTCGCGTAGCCAAA 

SEQ3 008 AAAGTTCACAAGTTACTACTGAAT CTTTGT CAAAAGCAGATAAAGTTCGCGTAGCCAAA 

SEQ3 009 AAAGTTCACAAGTTACTACTGAATCTTTGTCAAAAGCAGATAAAGTTCGCGTAGCCAAA 

SEQ3 010 GTAACCCCAAGCTGA TAAACCTTGAGCACGATAAGCTTTAATAGCTGAA 

SEQ3 001 AATCAAAAATGACTAAGGCGA<^TCTAAATCAAAAGTAGAAGATGTAAAACAGGCTCCA 

SEQ3 002 AATCAAAAATGACTAAGGCGACATCTAAATCAAAAGTAGAAGATGTAAAA 

SEQ3 003 AATCAAAAATGACTAAGGCGACATCTAAATCAAAAGTAGAAGATGTAAAACAGGCTCCA 

SEQ3 004 AATCAAAAATGATTAAGGCGACATCTAAATCAAAAGTAGAAGATGTAAAACAGGCTCCA 

SEQ3 005 AATCAAAAATGACTAAGGCGACATCTAAATCAAAAGTAGAAGATGTAAAACAGGCTCCA 

SEQ3 007 AAT CAAAAATGACTAAGGCGACATCTAAAT CAAAAGTAGAAGATGTAAAACAGGCTCCA 

SEQ3 008 AATCAAAAATGACTAAGGCGACAT CTAAAT CAAAAGTAGAAGATGTAAAACAGGCTCCA 

SEQ3 009 AATCAAAAATGACTAAGGCGACAT CTAAAT CAAAAGTAGAAGATGTAAAACAGGCTCCA 

SEQ3010 TAACTTGATCCTGAACTGTAGCTGTTGAACCCCAACCTGGCATCGTTTGGAAAAGTCCT 

SEQ3 001 AACCTTCTCAGGCATCTAATGAAGTCCC^AAATCAAGTTCTCAATCTACAGAAGCTAAT 

SEQ3 002 AACCTT CTCAGGCAT CTAATGAAGCC C CAAAAT CAAGTTCTCAATC TACAGAAG CTAAT 

SEQ3 003 AACCTTCT(^GGCATCTAATGAAGCCCa\AAATCAAGTTCTCMTCTACAGAAGCTAAT 

SEQ3004 AACCTTCTCAGGC^TCTAATGAAGCCCCAAAATCMGTTCTCAATCTACAGAAGCTAAT 

SEQ3 005 AACCTTCTCAGGCATCTAATGAAGCCCCAAAATCAAGTTCTC^ 

SEQ3 007 AACCTTCTCAGGCATCTAATGAAGCCCCAAAATC7UVGTTCTCAATCTACAGAAGCTAAT 

SEQ3 008 AACCTTCTCAGGCATCTAATGAAGCCCCAAAATCAAGTTCTCAATCTACAGAAGCTAAT 

SEQ3 009 AACCTTCTCAGGCATCTAATGAAGCCCCTU^TCAAGTTCTCAATCTAC^GAAGCTAAT 

SEQ3 010 AAGCTCCTGAGGCATT AGCAACATTAGGATTAC - CATTTGATTCACGGGCAATAAT 

SEQ3 001 TCTCAGC^CAAGTTACTGCGAGTGAAGAGGCGGCTGTAGAACAAGCAGTTGTAAC^GA 

SEQ3002 TCTCAGCAACAAGTTACTGCGAGTGAAGAGGCAGCTGTAGAACAAGC^ 

SEQ3 003 TCTCAGCAACAAGTTACTGCGAGTGAAGAGGCAGCTGTAGAACAAGCAGTTGT 

SEQ3 004 TCTCAGCAACAAGTTA CTG CGAGTGAAGAGGCAGCTGTAGAACAAGCAGTTGTAACAGA 

SEQ3 005 TCT CAGCAACAAGTTACTG CGAGTGAAGAGGCAGC TGTAGAACAAGCAGTTGTAACAGA 

SEQ3 007 TCT CAGCAACAAGTTACTG CGAGTGAAGAGGCGGC TGTAGAACAAGCAGTTGTAACAGA 

SEQ3 008 TCTCAGCAACAAGTTACTGCGAGTGAAGAGGCAGCTGTAGAACAAGCAGTTGTAACAGA 

SEQ3 009 TCTCAGCAACAAGTTACTGCGAGTGAAGAGGCGGCTGTAGAACAAGCAGTTGTAACAGA 

SEQ3 010 TGTTCCC^GTAGACTGAGGGACTCCTGTTGCAGC^GC(2ATTTGTGCTGCAGCAGCAGA 
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Table 30: Comparative Sequences relating to SAG2147 
(protein of uknown function / lipoprotein, putative) 

SEQ3 001 - -AAATACCCCTGCTACCAGTCAGGCACAACAAACTTATGCTGT^ 

SKQ3 0 0 2 - - AAA(^CCCCTGCTACCAGT(^GGCACAAC^GCTTATGCTGTTACTGAGACAACTTA 

SEQ3 003 - -AAACACCCCTGCTAC(^GTCAGGCACAACAAGCTTATGCTGTTACTGAGAC!AACTTA 

SEQ3 004 - -AAACACCCCTGCTACCAGTCAGGCACAACAAGCTTATGCTGTTACTGAGACAACTTA 

SEQ3 005 - - AAACACCCCTGCTACCAGTCAGGCAC^C^GCTTATGCTGTTACTGAGAC^CTTA 

SEQ3 007 - - AAATACCCCTGCTACC^GTC^GGCACAAC^AACTTATGCTGTTACTGAGACAACTTA 

SEQ3 008 - - AAACACCCCTGCTACCAGTCAGGCACAACAAGCTTATGCTGTTACTGAGACAACTTA 

SEQ3 009 - -AAATACCCCTGCTACCAGTCAGGCACAACAAACTTATGCTGTTACTGAGACAACTTA 

SEQ3010 CCGACCGCCCCTGCAGTATTTCCATTGCTCAATACTTG-GCCACTTGTCTGGTGTTGAG 



SEQ30 0 1 AAACCTGCTCAACACCAGACAAGTGGC - CAAGTATTGAGCAATGGAAATACTGCAGGGG 

SEQ3 002 AGACCTGCTCAACACCAGACGAGTGGC - CAAGTATTGAGTAATGGAAATACTGCAGGGG 

SEQ3 003 AGACCTGCTCAACACCAGACGAGTGGC - CAAGTATTGAGTAATGGAAATACTGCAGGGG 

SEQ3 004 AGACCTGCTCAACACCAGACGAGTGGC - CAAGTATTGAGTAATGGAAATACTGCAGGGG 

SEQ3 005 AGACCTGCTCAACACCAGACGAGTGGC - CAAGTATTGAGTAATGGAAATACTGCAGGGG 

SEQ3 007 AAACCTGCTCAACACCAGACAAGTGGC - CAAGTATTGAGCAATGGAAATACTGCAGGGG 

SEQ3 008 AGACCTGCTCAACACCAGACAAGTGGC - CAAGTATTGAGTAATGGAAATACTGCAGGGG 

SEQ3009 AAACCTGCTCAACACCAGACAAGTGGC - CAAGTATTGAGCAATGGAAATACTGCAGGGG 

SEQ3 010 AGGTTTGTAAGTTGTCTCAGTAACAGCATAAGTTTGTTGTGCCTGACTGGTAGCAGGGG 

SEQ3 00 1 GGTCGGATCTGCTGCTGCAGCACAAATGGCTGCTGCAACAGGAGTCCCTCAGTCTACTT 

SEQ3 002 TATTGGCTCAGC^GCTGCAGC^CAAATGGCTGCTGCAAC^GGAGTCCCTCAGTCTACTT 

SEQ3 003 TATTGGCTCAGCAGCTGC^GC^CAAATGGCTGCTGC^CAGGAGTCCCTCAGTCTACTT 

SEQ3 004 TATTGGCT(^GCAGCTGCAGCACAAATGGCTGCTGCAACAGGAGTCCCT(^GTCTACTT 

SEQ3 005 TATTGGCTCAGCAGCTGCA-GCACAAATGGCTGCTGCAAC^ 

SEQ3 007 GGTCGGATCTGCTGCTGCAGCACAAATGGCTGCTGCAACAGGAGTCCCTCAGTCTACTT 

SEQ3 008 TATTGGCTCAGCAGCTGCAGCACAAATGGCTGCTGCAACA.GGAGTCCCTCAGTCTACTT 

SEQ3 009 GGTCGGATCTGCTGCTGC^G(^CAAATGGCTGCTGCAAC^GGAGTCCCTCAGTCTACTT 

SEQ3010 A - TTT- - TCTGTTACAACTGCTTGTTCTACAGCCGCCTCTTCACTCGCAGTAACTTGTT 



SEQ3 001 GGGAACATATTATTGCCCGTGAATCAAATGGTAATCCTAATGTTGCTAATGCCTCAGGAG 

SEQ3 002 GGGAACATATTATTGCCCGTGAATCAAATGGTAATCCTAATGTTGCTAATGCCTCAGGAG 

SEQ3 003 GGGAACATATTATTGC CCGTGAATCAAATGGTAATCCTAATGTTGCTAATGCCT CAGGAG 

SEQ3004 GGGAACATATTATTGCCCGTGAATCAAATGGTAAT CCTAATGTTGCTAATGCCT CAGGAG 

SEQ3 005 GGGAACATATTATTGCCCGTGAATCAAATGGTAATCCTAATGTTGCTAATGCCT CAGGAG 

SEQ3 007 GGGAACATATTATTGCCCGTGAATCAAATGGTAATCCTAATGTTGCTAATGCCTCAGGAG 

SEQ3 008 GGGAACATATTATTGCCCGTGAATCAAATGGTAATCCTAATGTTGCTAATGCCTCAGGAG 

SEQ3 009 GGGAAC^TATTATTGCCCGTGAATCAAATGGTAATCCTAATGTTGCTAATGCCTCAGGAG 

SEQ3 010 GCTGAGA - ATTAGCTT CTGTAGATTGAG AA - - CTTGATTTTGGGGCTTCATTAGATG 

r 

SEQ3 001 CTTCAGGACTTTTCCAAACGATGCCAGGTTGGGGTTCAACAGCTAC^ 

SEQ3 002 CTTCAGGACTTTTCCAAACGATGCCAGGTTGGGGTTGA^ 

SEQ3 003 CTTCAGGACTTTTCCAAACGATGCCAGGTTGGGGTTCAAC^^ 

SEQ3 004 CTTCAGGACTTTTCCAAACGATGCCAGGTTGGGGTTCAACAGCTACAGTTCAGGA 

SEQ3 005 CTTC^GGACTTTTCCAAACGATGCCA.GGTTGGGGTTCAACAGCTACAGTTCAGGAT 

SEQ3 007 CTTCAGGACTTTTCCAAACGATGCCAGGTTGGGGTTCAAC^GCTACAGTTC^GGATC^G 

SEQ3 008 CTTCAGGACTTTTCCAAACGATGCCAGGTTGGGGTTCAACAGCTACAGTTCAGG^ 

SEQ3 009 CTTCAGGACTTTTCCAAACGATGCCAGGTTGGGGTTCAACAGCTACAGTTCAGGA 

SEQ3 0 1 0 CCTGAGAAGGTTTT GGAGCCTGTTTTACATCTTCTACTTTTGATTTAGATGTCGC 

SEQ3 001 TAATTCAGCTATTAAAGCTTATCGTGCTCAAGGTTTATCAGCTTGGGGTTAC - - 

SEQ3 002 TAATTCAGCTATTAAAGCTTATCGTGCTCAAGGTTTATCAGCTTGGGGTTAC- - 

SEQ3 0 0 3 TAATTCAGCTATTAAAGCTTATCGTGCTCAAGGTTTATCAGCTTGGGGTTAC - - 

SEQ3004 

SEQ3 005 TAATTCAGCTATTAAAGCTTATCGTGCTCAAGGTTTATCAGCTTGGGGTTAC- - 

SEQ3 007 TAATTCAGCTATTAAAGCTTATCGTGCTCAAGGTTTATCAGCTTGGGGTTAC- - 

SEQ3 008 TAATTCAGCTATTAAAGCTT 

SEQ3 009 TAATTCAGCTATTAAAGCTTATCGTGCTCAAGGTTTATCAGCTTGGGGTTA 

SEQ3010 TTAGTCA-TTTTTGATTTTTTGGCTACGCGAACTTTATCTGCTTTTGACAAAGA 
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Table 30: Comparative Sequences relating to SAG2147 
(protein of uknown function / lipoprotein, putative) 

>SEQ ID NO 3050: 25_1169NT frame: 1 

KSSQVTTESLSKADKVRVAKKSKMTKATSKSKVEDVKQAPKPSQASNEVPKSSSQSTEAN 
S Q Q Q VT AS E EAAVE QA WT ENT PAT S QAQ Q T YAVTE TT Y K P AQHQTS GQVL S NGNTAGAV 
GSAAAAQMAAATGVPQS TWEHI IARESNGNPNVANAS GASGL F QTMPGWGS TATVQDQVN 
SA I KA YRAQGLS AWG Y 

>SEQ ID NO 3051:25_18RS21 frame: 1 

KSSQVTTESIjSKADKVRVAKKSKMTKATSKSKVEDVKQAPKPSQASNEAPKSSSQSTEAN 
SQQ QVTASE EAAVEQAVVTENTPATS QAQQA YAVTETT YRPAQHQTS GQVLSNGNTAGAI 
GSAAAAQMAAATGVPQSTWEHI IARESNGNPWANASGASGLFQTMPGWGSTATVQDQVN 
SAIKAYRAQGLSAWGY 

>SEQ ID NO 3 052:25_2603 frame: 1 

KSSQVTTESLSKADKVRVAKKSKMTKATSKSKVEDVKQAPKPSQASNEAPKSSSQSTEAN 
SQQQVTASEEAAVEQAVVTENTPATSQAQQAYAVTETTYRPAQHQTSGQVLSNGNTAGAI 
GSAAAAQMAAATGVPQSTWEHI I ARE S NGN PNVANA S GAS GLFQTMPGWGSTATVQDQVN 
SAIKAYRAQGLSAWGY 

>SEQ ID NO 3053:25_090 frame: 3 

AKKS KM I KATSKSKVEDVKQAPKPSQASNEAPKS S SQS TEANS QQQVTAS EEAAVEQAW 
TENT PATS QAQ QAYAV TE T T YRPAQHQTS GQVL S NGNTAGA I G S AAAAQMAAATGV P Q S T 
WEHI I ARESNGNPNVANAS GAS GLFQTMPGWGSTATVQ 

>SEQ ID NO 3054:25_A909 frame: 1 

KATSKSKVEDVKQAPKPSQASNEAPKSSSQSTEANSQQQVTASEEAAVEQAWTENTPAT 
SQAQQA YAVTETT YRPAQHQTS GQVLSNGNTAGAI GS AAAAQMAAATGVPQS TWEH 1 1 AR 
ESNGNPNVANASGAS GL FQTMPGWGS TATVQNQVNSAI KA YRAQGL S 

>SEQ ID NO 3 055:25_CJB110 frame: 3 

SLSKADKVRVAKKS KMTKATSKS KVEDVKQAP KP S QAS NEAPKS S S QS TEANS Q Q Q VTA S 
EEAAVE QAWTENTPATS QAQQAYAVTETT YRPAQHQTS GQVL SNGNTAGAI GS AAAAQM 
AAATGVPQSTWEHIIARESNGNPNVANASGASGLFQTMPGWGSTATVQDQVNSAIKAYRA 
QGLSAWGY 

>SEQ ID NO 3056:25__COH1 frame: 1 

KSSQVTTESLSKADKVRVAKKS KMTKATSKS KVEDVKQAP KPSQASNEAPKSSSQSTEAN 
S Q QQ VT AS E EAAVE QAWTENT PAT S QAQQT YAVT E TTYKPAQHQTS GQVL S NGNTAGAV 

GSAAAAQMAAATGVPQSTWEHI IARESNGNPNVANAS GAS GLFQTMPGWGSTATVQDQVN 
SAIKAYRAQGLSAWGY 

>SEQ ID NO 3057:25_H36B frame: 1 

KSSQVTTESLSKADKVRVAKKS KMTKATSKS KVEDVKQAPKPSQASNEAPKSSSQSTEAN 
SQQQVTASEEAAVEQAWTENTPATSQAQQAYAVTETTYRPAQHQTSGQVLSNGNTAGAI 
GSAAAAQMAAATGVPQSTWEHI IARESNGNPNVANAS GAS GLFQTMPGWGSTATVQDQVN 
SAIKA 

>SEQ ID NO 3058:25_M732 frame: 1 

KSSQVTTESLSKADKVRVAKKS KMTKATSKS KVEDVKQAP KPSQASNEAPKSSSQSTEAN 
SQQQVTAS EEAAVEQAWTENTPATS QAQQT YAVTETT YKP AQHQTS GQVL SNGNTAGAV 
GSAAAAQMAAATGVPQSTWEHI IARESNGNPNVANASGAS GLFQTMPGWGSTATVQDQVN 
SAIKA YRAQGLS AWG 

>SEQ ID NO 3059:25_M781 frame: 4 

SLSKADKVRVAKKS KMTKATSKSKVEDVKQAPKPSQASNEAPKSSSQSTEANSQQQVTAS 
EEAAVEQA WTENT PATS QAQQT YAVTETT YKPAQHQTS GQVL SNGNTAGAVGS AAAAQM 
AAATGVPQ S TWEHI I ARES NGNPNVANASGAS GLFQTMPGWGS TATVQDQVNS AI KAYRA 
QGLSAWGY 



SEQ3050 SSQVTTESLSKADKVRVAKKS KMTKATSKS KVEDVKQAPKPSQASNEVPKSSSQSTEAN 

SEQ3051 SSQVTTESLSKADKTOVAKKS KMTKATSKS KVEDVKQAPKPSQASNEAPKSSSQSTEAN 

SEQ3052 SSQVTTESLSKADKVRVAKKSKMTKAlTSKSKVEDVKQAPKPSQASNEAPKSSSQSTEAN 

SEQ30S3 AKKS KMI KATSKSKVEDVKQAPKPSQASNEAPKS SSQSTEAN 

SEQ3 054 KATSKSKVEDVKQAPKPSQASNEAPKS SSQSTEAN 

SEQ3055 SLSKADKVRVAKKS KMTKATSKS KVEDVKQAPKPSQASNEAPKSSSQSTEAN 

SEQ3 05 6 SSQVTTESLSKADKVRVAKKSKMTKATSKSKVEDVKQAPKPSQASNEAPKSSSQSTEAN 

SEQ3 057 SSQVTTESLSKADKVRVAKKSKMTKATSKSKVEDVKQAPKPSQASNEAPKSSSQSTEAN 

SEQ3 058 SSQVTTESLSKADKVRVAKKSKMTKATSKSKVEDVKQAPKPSQASNEAPKSSSQSTEAN 

SEQ3059 SLSKADKVRVAKKS KMTKATSKSKVEDVKQAPKPSQASNEAPKSSSQSTEAN 
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Table 30: Comparative Sequences relating to SAG2147 
(protein of uknown function / lipoprotein, putative) 



SEQ3 05 0 
SEQ3051 
SEQ3 052 
SEQ3053 
SEQ3054 
SEQ3 055 
SEQ305 6 
SEQ3057 
SEQ3058 
SEQ3059 

SEQ3050 
SEQ3 051 
SEQ3052 
SEQ3053 
SEQ3054 
SEQ305 5 
SEQ3056 
SEQ3057 
SEQ3058 
SEQ3059 

SEQ3 05 0 
SEQ3051 
SEQ3052 
SEQ3053 
SEQ3054 
SEQ3055 
SEQ3 056 
SEQ3057 
SEQ3 05 8 
SEQ3059 



S QQQVTAS EEAAVEQAWTENTPATS QAQQTYAVTETT YKPAQHQT S GQVL SNGNTAGAV 
S Q Q Q VTAS E EAA VEQ AWT ENT PAT S Q AQ Q A YAVTE TT YR P AQHQT S GQ VL SNGNTAGA I 
S Q Q Q VTA S EEAAVEQAWT ENT PAT S Q AQ Q AYAVTE T T YR P AQHQ T S GQVL S NGNTAGA I 
S QQQVTAS EEAAVEQAWTENT PATSQAQQAYAWETT YRPAQHQTSGQVLSNGNTAGAI 
S Q Q QVT A S E EAAVE QA WTENT P ATS Q AQ QA YAVTE TTYR PAQHQ T S GQ VL SNGNTAGA I 
SQQQVTASEEAAVEQAVVTENTPATSQAQQAYAVTETTYRPAQHQTSGQVIjSNGNTAGAI 
SQQQVTASEEAAVEQAVVTENTPATSQAQQTYAVTETTYKPAQHQTSGQVLSNGNTAGAV 
SQQQVTAS EEAAVEQAVVTENTPATSQAQQAYAVTETT YRPAQHQTSGQVLSNGNTAGA I 
SQQQVTASEEAAVEQAWTENTPATSQAQQTYAVTETTYKPAQHQTSGQVLSNGNTAGAV 
SQQQVTAS EEAAVEQAWT ENT PATSQAQQTYAVTETTYKPAQHQTSGQVLSNGNTAGAV 

GSAAAAQMAAATGVPQSTWEHIIARESNGNPNVANASGASGLFQTMPGWGSTATVQDQVN 
GSAAAAQMAAATGVPQS TWEHI I ARESNGNPNVANASGAS GLFQTMPGWGS TATVQDQVN 
GSAAAAQMAAATGVPQS TWEHI I ARESNGNPNVANAS GAS GLFQTMPGWGS TATVQDQVN 

GSAAAAQMAAATGVPQSTWEHIIARESNGNPNVANASGASGLFQTMPGWGSTATVQ 

GSAAAAQMAAATGVPQSTWEHIIARESNGNPNVANASGASGIiFQTMPGWGSTATVQNQVN 
GSAAAAQMAAATGVPQSTWEHI IARESNGNPNVANASGAS GLFQTMPGWGS TATVQDQVN 
GSAAAAQMAAATGVPQS TWEHI I ARESNGNPNVANASGAS GLFQTMPGWGS TATVQDQVN 
GSAAAAQMAAATGVPQSTWEHI IARESNGNPNVANASGASGLFQTMPGWGSTATVQDQVN 
GSAAAAQMAAATGVPQS TWEHI I ARE SNGNPNVANASGAS GLFQTMPGWGS TATVQDQVN 
GSAAAAQMAAATGVPQS TWEHI I ARESNGNPNVANASGAS GLFQTMPGWGS TATVQDQVN 

AI KAYRAQGLS AWGY 
AI KAYRAQGLS AWGY 
AIKAYRAQGLSAWGY 



AI KAYRAQGLS 

AI KAYRAQGLSAWG Y 
AIKAYRAQGLSAWGY 

AIKA 

AI KAYRAQGLSAWG - 
AIKAYRAQGLSAWGY 
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Table 31: Comparative Sequences relating to SAG2148 

(LysM domain protein) 

SEQ ID NO. 3101: SAG2148 FROM THE 1169NT1 GBS TYPE V STRAIN 

GCATCTTATACCGTGAAATCAGGTGATACCTTATCAGCTATTGCTAAAAATCATAAAACTACGGTACAAGAGTTAGTG 
TCTCTCAATAGTATCAGTAACGCTGATGTCATCAGTATAGGTGATGTTTTAAAATTGGATAATTCTACAGCTAGTCAA 
GCAGAAGCAAAATCTCAACCAACAATTGAAAATTCAATGAATTCTTCATCAAATTTGAGTTCAAGTGATTCAGCTGCA 
AAAGAAGAAATAGCTCGTCGTGAATC AAATGGTAGTTATAC TGC ACAGAATGGACAATAT TATGGAAG ATAT C AACTG 
TCTCAATCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAAGTAGCGGACAATTATGTGGCTTCTCGTTAC 
GGATCTTGGTCGGCAGCGCTATCATTTTGGAATAGTAACGGCTGGTAT 

SEQ ID NO. 3102: SAG2148 FROM THE 18RS21 GBS TYPE II STRAIN 

GCATCTTATAC CGTGAAAT CAGGTGATAC CTTATCAGC TATTGCTAAAAATC AT AAAACT ACGGTACAAGAGTTAGTG 
T CT CTC AATAGTATC AGTAACGCTGATGTC ATC AGTATAGGTGATGTTTTAAAATTGGAT AATTCTAC AG C TAGTCAA 
GCAGAAGCAAAATCTCAACCAACAATTGAAAATTCAATGAATTCTTCATCAAATTTGAGTTCAAGTGATTCAGCCGCA 
AAAGAAGAAATAGCTCGTCGTGAATCAAATGGTAGTTATACTGCACAGAATGGACAATATTATGGAAGATATCAACTG 
TCTCAATCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAAGTAGCGGACAATTATGTGGTTTCTCGTTAC 
GGATCTTGGTCGGCAGCGCTATCATTTTGGAATAGTAACGGCTGGTAT 

SEQ ID NO. 3103: SAG2148 FROM THE 2603 V/R GBS TYPE V STRAIN 

GCATCTTATACCGTGAAATCAGGTGATACCTTATCAGCTATTGCTAAAAATCATAAAACTACGGTACAAGAGTTAGTG 
TCTCTCAATAGTATCAGTAACGCTGATGTCATCAGTATAGGTGATGTTTTAAAATTGGATAATTCTACAGCTAGTCAA 
GCAGAAGCAAAAT CT C AAC C AACAATTGAAAATTC AATGAATTCTTC ATCAAATTTGAGT TC AAGTGATT CAGCCGC A 
AAAGAAGAAATAGCTCGTCGTGAATCAAATGGTAGTTATACTGCACAGAATGGACAATATTATGGAAGATATCAACTG 
TCTCAATCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAAGTAGCGGACAATTATGTGGTTTCTCGTTAC 
GGATCTTGGTCGGCAGCGCTATCATTTTGGAATAGTAACGGCTGGTAT 

SEQ ID NO. 3104: SAG2148 FROM THE 090 GBS TYPE la STRAIN 

GCATCTTATACCGTGAAATCAGGTGATACCTTATCAGCTATTGCTAAAAATCATAAAACTACGGTACAAGAGTTAGTG 
TCTC TCAATAGTATCAGTAACGC TGATGT CATCAGTATAGGTGATGTTTTAAAATTGGAT AATTCTAAAG CTAGT C AA 
GCAGAAGCAAAATCTCAACCAACAATTGAAAATTCAATGAATTCTTCATCAAATTTGAGTTCAAGTGATTCAGCCGCA 
AAAGAAGAAATAGCTCGTCGTGAATCAAATGGTAGTTATACTGCACAGAATGGACAATATTATGGAAGATATCAACTG 
TCTCAATCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAAGTAGCGGACAATTATGTGGTTTCTCGTTAC 
GGATCTTGGTCGGCAGCGCTATCATTTTGGAATAGTAACGGCTGGTAT 

SEQ ID NO. 3105: SAG2148 FROM THE A909 GBS TYPE la STRAIN 

GCATCTTATACCGTGAAATCAGGTGATACCTTATCAGCTATTGCTAAAAATCATAAAACTACGGTACAAGAGTTAGTG 
T CTCTCAATAGT ATC AGTAACGC TGATGT CAT C AGT ATAGGTGATGTTT TAAAATTGGATAATT C TACAGC TAGTCAA 
GCAGAAGCAAAAT C TCAAC C AACAATTGAAAATTCAATGAATT CTTCATCAAATTTGAGT TCAAGTGATT CAGC CGC A 
AAAGAAGAAATAGCTCGTCGTGAATCAAATGGTAGTTATACTGCACAGAATGGACAATATTATGGAAGATATCAACTG 
TCTCAATCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAAGTAGCGGACAATTATGTGGCTTCTCGTTAC 
GGATCTTGGTCGGCAGCGCTATCATTTTGGAATAGTAACGGCTGGTAT 

SEQ ID NO. 3106: SAG2148 FROM THE CJB110 GBS NONTYPEABLE STRAIN 

GC ATCTTATACCGTGAAATCAGGTGATACC TTATC AGCTATTGCTAAAAAT C ATAAAACTACGGTAC AAGAGT TAGTG 
T CT CTC AATAGTAT CAGTAACGC TGATGT CATC AGTATAGGTGATGTTTTAAAATTGGAT AATTC TAAAGCT AGT CAA 
GCAGAAGCAAAATCTCAACCAACAATTGAAAATTCAATGAATTCTTCATCAAATTTGAGTTCAAGTGATTCAGCCGCA 
AAAGAAGAAATAGCT CGT CGTGAATCAAATGGTAGTTAT AC TGCACAG AATGGAC AATATTATGGAAGATAT CAACTG 
TCTCAATCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAAGTAGCGGACAATTATGTGGTTTCTCGTTAC 
GGATCTTGGTCGGCAGCGCTATCATTTTGGAATAGTAACGGCTGGTAT 

SEQ ID NO. 3107: SAG2148 FROM THE COH1 GBS TYPE III STRAIN 

GCATCTTATAC CGTGAAAT CAGGTGATAC CTTAT CAGC TATTGCTAAAAATC ATAAAACTACGGTAC AAT AGT TAGTG 
TCTCTCAATAGTATCAGTAACGCTGATGTCATCAGTATAGGTGATGTTTTAAAATTGGATAATTCTACAGCTAGTCAA 
GCAGAAGCAAAAT CTC AAC C AAC AATTGAAAATTCAATGAATTCTTCATCAAATTTGAGTTCAAGTGATTCAGCTGC A 
AAAGAAGAAATAGCTCGTCGTGAATCAAATGGTAGTTATACTGCACAGAATGGACAATATTATGGAAGATATCAACTG 
TCTCAATCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAAGTAGCGGACAATTATGTGGCTTCTCGTTAC 
GGATCTTGGTCGGCAGCGCTATCATTTTGGAATAGTAACGGCTGGTAT 
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Table 31: Comparative Sequences relating to SAG2148 

(LysM domain protein) 

SEQ ID NO. 3108: SAG2148 FROM THE H36b GBS TYPE lb STRAIN 
(REVERSE COMPLEMENT) 

GCATCTTATACCGTGAAATCAGGTGATACCTTATCAGCTATTGCTAAAAATCATAAAACTACGGTACAAGAGTTAGTG 
TCTCTCAATAGTATCAGTAACGCTGATGTCATCAGTATAGGTGATGTTTTAAAATTGGATAATTCTACAGCTAGTCAA 
GCAGAAGCAAAATCTCAACCAACAATTGAAAATTCAATGAATTCTTCATCAAATTTGAGTTCAAGTGATTCAGCCGCA 
AAAGAAGAAATAGCTCGTCGTGAATCAAATGGTAGTTATACTGCACAGAATGGACAATATTATGGAAGATATCAACTG 
TCTCAATCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAAGTAGCGGACAATTATGTGGCTTCTCGTTAC 
GGATCTTGGTCGGCAGCGCTATCATTTTGGAATAGTAACGGCTGGTAT 



SEQ ID NO. 3109: SAG2148 FROM THE JM9130013 GBS TYPE VIII STRAIN 
(REVERSE COMPLEMENT) 

GCATCTTATACCGTGAAATCAGGTGATACCTTATCAGCTATTGCTAAAAATCATAAAACTACGGTACAAGAGTTAGTG 
TCTCTCAATAGTATCAGTAACGCTGACGTCATCAGTATAGGTGATGTTTTAAAATTGGATAATTCTACAACTAGTCAA 
GCAGAAGCAAAATCTCAACCAACAATTGAAAATTCAATGAATTCTTCATCAAATTTGAGTTCAAGTGATTCAGCCGCA 
AAAGAAGAAATAGCT CGT CGTGAATCAAATGGTAGT TATACTGCACAGAATGGACAATAT TATGGAAGATAT CAACTG 
TCTCAATCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAAGTAGCGGACAATTATGTGGCTTCTCGTTAC 
GGATCTTGGTCGGCAGCGCTATCATTTTGGAATAGTAACGGCTGGTAT 

SEQ ID NO. 3110: SAG2148 FROM THE M732 GBS TYPE III STRAIN 

GCATCTTATACCGTGAAATCAGGTGATACCTTATCAGCTATTGCTAAAAATCATAAAACTACGGTACAATAGTTAGTG 
TCT CTC AATAGTATC AGTAACGCTGATGT CAT C AGTATAGGTGATGTTTTAAAATTGGAT AATT CTAC AGCTAGTCAA 
G C AGAAGC AAAAT C T C AAC C AAC AATT GAAAAT T C AATGAAT T C T T CAT C AAAT T T G AGT T C AAGTGAT T C AG C TG C A 
AAAGAAGAAATAGCTCGTCGTGAATCAAATGGTAGTTATACTGCACAGAATGGACAATATTATGGAAGATATCAACTG 
TCTCAATCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAAGTAGCGGACAATTATGTGGCTTCTCGTTAC 
GGATCTTGGTCGGCAGCGCTATCATTTTGGAATAGTAACGGCTGGTAT 



SEQ ID NO. 3111s SAG2148 FROM THE M781 GBS TYPE III STRAIN 
(REVERSE COMPLEMENT) 

GCATCTTATACCGTGAAATCAGGTGATACCTTATCAGCTATTGCTAAAAATCATAAAACTACGGTACAATAGTTAGTG 
TCTCTCAATAGTATCAGTAACGCTGATGTCATCAGTATAGGTGATGTTTTAAAATTGGATAATTCTACAGCTAGTCAA 
GCAGAAGCAAAATC TC AAC CAACAATTGAAAATTCAATGAATTCTTCATCAAATTTGAGT TCAAGTGAT TCAGCTGCA 
AAAGAAGAAATAGC TCGT CGTGAATC AAATGGTAGTTATACTGC ACAGAATGGACAATATTATGGAAGATAT CAACTG 
TCT CAATCTT ACCT AAATGGCG ACTT ATCTC C TG AAAAT CAAGAAAAAGTAGCGGAC AATT ATGTGGC TTCTCGTTAC 
GGATCTTGGTCGGCAGCGCTATCATTTTGGAATAGTAACGGCTGGTAT 



SEQ3101 
SEQ3102 
SEQ3103 
SEQ3104 
SEQ3105 
SEQ3106 
SEQ3107 
SEQ3108 
SEQ3109 
SEQ3110 
SEQ3111 



GCATCTTATACCGTGAAATCAGGTGATACCTTATCAGCTATTGCTAAAAATCATAAAACT 
GCATCTTATACCGTGAAATCAGGTGATACCTTATCAGCTATTGCTAAAAATCATAAAACT 
GCATCTTATACCGTGAAATCAGGTGATACCTTATCAGCTATTGCTAAAAATCATAAAACT 
GCATCTTATACCGTGAAATCAGGTGATACCTTATCAGCTATTGCTAAAAATCATAAAACT 
GCATCTTATACCGTGAAATC^GGTGATACCTTATCAGCTATTGCTAAAAATCATAAAACT 
GCATCTTATACCGTGAAATCA.GGTGATACCTTATCAGCTATTGCTAAAAATCATAAAACT 
GCATCTTATACCGTGAAATCAGGTGATACCTTATCAGCTATTGCTAAAAATCATAAAACT 
GCATCTTATACCGTGAAATCAGGTGATACCTTATCAGCTATTGCTAAAAATCATAAAACT 
G(^TCTTATACCGTGAAATCAGGTGATACCTTATCAGCTATTGCTAAAAATCATAAAACT 
GCATCTTATACCGTGAAATCAGGTGATACCTTATCAGCTATTGCTAAAAATCATAAAACT 
GCATCTTATACCGTGAAATCAGGTGATACCTTATCAGCTATTGCTAAAAATCATAAAACT 



SEQ3101 
SEQ3102 
SEQ3103 
SEQ3104 
SEQ3105 
SEQ3106 
SEQ3107 
SEQ3108 
SEQ3109 
SEQ3110 
SEQ3111 



ACGGTACAAGAGTTAGTGTCTCTCAATAGTATCAGTAACGCTGATGTCATCAGTATAGGT 
ACGGTACAAGAGTTAGTGTCTCTCAATAGTATCAGTAACGCTGATGTCATCAGTATAGGT 
ACGGTACAAGAGTTAGTGTCTCTCAATAGTATCAGTAACGCTGATGTCATCAGTATAGGT 
ACGGTACAAGAGTTAGTGTCTCTCAATAGTATCAGTAACGCTGATGTCATCAGTATAGGT 
ACGGTACAAGAGTTAGTGTCTCTCAATAGTATCAGTAACGCTGATGTCATCAGTATAGGT 
ACGGTAC^AGAGTTAGTGTCTCTCAATAGTATCAGTAACGCTGATGTCATCAGTATAGGT 
ACGGTACAATAGTTAGTGTCTCTCAATAGTATCAGTAACGCTGATGTCATCAGTATAGGT 
ACGGTACAAGAGTTAGTGT CTC TCAATAGTATCAGTAACGCTGATGTCATCAGTATAGGT 
ACGGTACAAGAGTTAGTGTCTCTCAATAGTATCAGTAACGCTGACGTCATCAGTATAGGT 
ACGGTACAATAGTTAGTGTCTCTCAATAGTATCAGTAACGCTGATGTCATCAGTATAGGT 
ACGGTACAATAGTTAGTGTCTCTCAATAGTATCAGTAACGCTGATGTCATCAGTATAGGT 
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Table 31: Comparative Sequences relating to SAG2148 

(LysM domain protein) 



SEQ3101 
SEQ3102 
SEQ3103 
SEQ3104 
SEQ3105 
SEQ310 6 
SEQ3107 
SEQ3108 
SEQ3109 
SEQ3110 
SEQ3111 

SEQ3101 
SEQ3102 
SEQ3103 
SEQ3104 
SEQ3105 
SEQ310 6 
SEQ3107 
SEQ310 8 
SEQ3109 
SEQ3110 
SEQ3111 

SEQ3101 
SEQ3102 
SEQ3103 
SEQ3X04 
SEQ3105 
SEQ3106 
SEQ3107 
SEQ3108 
SEQ3109 
SEQ3110 
SEQ3111 

SEQ3101 
SEQ3102 
SEQ3103 
SEQ3104 
SEQ3105 
SEQ3106 
SEQ3107 
SEQ3108 
SEQ3109 
SEQ3110 
SEQ3111 

SEQ3101 
SEQ3102 
SEQ3103 
SEQ3104 
SEQ3105 
SEQ3106 
SEQ3107 
SEQ310 8 
SEQ3109 
SEQ3110 
SEQ3111 



GATGTTTTAAAATTGGATAATTCTACAGCTAGTCAAGCL^G^ 

GATGTTTTAAAATTGGATAATTCTACAGCTAGTCAAGCAGAAGCAAAATCTCAA 

GATGTTTTAAAATTGGATAATTCTACAGCTAGTCAAGCAGAAGCAAAATCTCAACCAACA 

GATGTTTTAAAATTGGATAATTGTAAAGCTAGTCAAGGAGAAGCAAAATCTCAACCAACA 

GATGTTTTAAAATTGGATAATTCTACAGCTAGTCAAGCA.GAAGCAAAATCTCAACCAACA 

GATGTTTTAAAATTGGATAATTCTAAAGCTAGTCAAGCAGAAGCAAAATCTCAACCAACA 

GATGTTTTAAAATTGGATAATTCTACAGCTAGTCAAGCAGAAGCAAAATCTCAACCAACA 

GATGTTTTAAAATTGGATAATTCTACAGCTAGTCAAGCAGAAGCAAAATCTCAACCAACA 

GATGTTTTAAAATTGGATAATTCTACAACTAGTCAAGCAGAAGCAAAATCTCAACCAACA 

GATGTTTTAAAATTGGATAATTCTACAGCTAGTCAAGCAGAAGCAAAATCTCAACCAACA 

GATGTTTTAAAATTGGATAATTCTACAGCTAGTCAAGCAGAAGCAAAATCTCAACCAACA 

ATTGAAAATTCAATGAATTCTTCATCAAATTTGAGTTCAAGTGATTCAGCTGCAAAAGAA 

ATTGAAAATTCAATGAATTCTTCATCAAATTTGAGTTCAAGTGATTCAGCCGCAAAAGAA 

ATTGAAAATTCAATGAATTCTTCATCAAATTTGAGTTCAAGTGATTC^ 

ATTGAAAATTCAATGAATTCTTGATCAAATTTGAGTTCAAGTGATTCA 

ATTGAAAATTC^TGAATTCTTCATCAAATTTGAGTTCAAGTGATTCAGCCGCAAAAGAA 

ATTGAAAATTCAATGAATTCTTCATCAAATTTGAGTTCAAGTGATTCAGCCGCAAAAGAA 

ATTGAAAATTCAATGAATTCTTC^TCAAATTTGAGTTCAAGTGATTCAGCTGCAAAAGAA 

ATTGAAAATTCAATGAATTCTTCATCAAATTTGAGTTCAAGTGATTC^GCCGCAAAAGAA 

ATTGAAAATTCAATGAATTCTTCATCAAATTTGAGTTCAAGTGATTCAGCCGCAAAAGAA 

ATTGAAAATTCAATGAATTCTTCATCAAATTTGAGTTCAAGTGATTCAGCTGCAAAAGAA 

ATTGAAAATTCAATGAATTCTTCATCAAATTTGAGTTCAAGTGATTCAGCTGCAAAAGAA 

GAAATAGCTCGTCGTGAATCAAATGGTAGTTATACTGCACAGAATGGACAATATTATGGA 
GAAATAG C T CGTCGT GAATCAAATGGTAGT TATACTG CACAGAATG GACAATATTATGGA 
GAAATAGCTCGTCGTGAAT(^U^TGGTAGTTATACTGC^CAGAATGGACAATATTATGGA 
GAAATAGCTCGTCGTGAATCAAATGGTAGTTATACTGCACAGAATGGACAATATTATGGA 
GAAATAGCTCGTCGTGAATCAAATGGTAGTTATACTGCACAGAATGGACAATATTATGGA 
GAAATAGCTCGTCGTGAATCAAATGGTAGTTATACTGCACAGAATGGACAATATTATGGA 
GAAATAGCTCGTCGTGAATCAAATGGTAGTTATACTGCACAGAATGGACAATATTATGGA 
GAAATAG CTCGTCGTGAATCAAATGGTAGT TAT AC TGCACAGAATGGACAATATTATGGA 
GAAATAGCTCGTCGTGAATCAAATGGTAGTTATACTGCACAGAATGGACAATATTATGGA 
GAAATAGCTCGTCGTGAATCAAATGGTAGTTATACTGCACAGAATGGACAATATTATGGA 
GAAATAGCTCGTCGTGAATCAAATGGTAGTTATACTGCACAGAATGGACAATATTATGGA 

AGATATCAACTGTCTCAATCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAA 
AGATATCAACTGTCTCAATCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAA 
AGATATCAACTGTCTCAATCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAA 
AGATATCAACTGTCTCAATCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAA 
AGATATCAACTGTCTCAATCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAA 
AGATATCAACTGTCTCAATCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAA 
AGATATCAACTGTCTCAATCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAA 
AGATATCAACTGTCTCAATCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAA 
AGATAT CAACTGTCTCAAT CTTACCTAAATGGCGACTTAT CTCCTGAAAATCAAGAAAAA 
AGATATCAACTGTCTCAATCTTACCTAAATGGCGACTTATCTCCTGAAAATCAAGAAAAA 
AGATAT CAACTGTCTCAATCTTACCTAAATGGCGACTTAT CTCCTGAAAATCAAGAAAAA 

GTAGCGGACAATTATGTGGCTTCTCGTTACGGATCTTGGTCGGCAGCGCTATCATTTTGG 
GTAGCGGACAATTATGTGGTTTCTCGTTACGGATCTTGGTCGGCAGCGCTATCATTTTGG 
GTAGCGGACAATTATGTGGTTTCTCGTTACGGATCTTGGTCGGCAGCGCTATCATTTTGG 
GTAGCGGACAATTATGTGGTTTCTCGTTACGGATCTTGGTCGGCAGCGCTATCATTTTGG 
GTAGCGGACAATTATGTGGCTTCTCGTTACGGATCTTGGTCGGCAGCGCTATCATTTTGG 
GTAGCGGACAATTATGTGGTTTCTCGTTACGGATCTTGGTCGGCAGCGCTATCATTTTGG 
GTAGCGGACAATTATGTGGCTTCTCGTTACGGATCTTGGTCGGCAGCGCTATCATTTTGG 
GTAGCGGACAATTATGTGGCTTCTCGTTACGGATCTTGGTCGGCAGCGCTATCATTTTGG 
GTAGCGGACAATTATGTGGCTTCTCGTTACGGATCTTGGTCGGCAGCGCTATCATTTTGG 
GTAGCGGACAATTATGTGGCTTCTCGTTACGGATCTTGGTCGGCAGCGCTATCATTTTGG 
GTAGCGGACAATTATGTGGCTTCTCGTTACGGATCTTGGTCGGCAG CGCTAT CATT TTGG 
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Table 31: Comparative Sequences relating to SAG2148 

(LysM domain protein) 



SEQ3101 
SEQ3102 
SEQ3103 
SEQ3104 
SEQ3105 
SEQ3106 
SEQ3107 
SEQ3108 
SEQ3109 
SEQ3110 
SEQ3111 



AATAGTAACGGCTGGTAT 
AATAGTAACGGCTGGTAT 
AATAGTAACGGCTGGTAT 
AATAGTAACGGCTGGTAT 
AATAGTAACGGCTGGTAT 
AATAGTAACGGCTGGTAT 
AATAGTAACGGCTGGTAT 
AATAGTAACGGCTGGTAT 
AATAGTAACGGCTGGTAT 
AATAGTAACGGCTGGTAT 
AATAGTAACGGCTGGTAT 



« ► 

>SEQ ID NO 3150:15_1169NT frame: 1 

AS YTVKSGDTLSAI AKNHKTTVQELVSLNS I SNADVI S IGDVLKLDNSTASQAEAKSQPT 
IENSMNSSSNLS SSDSAAKEEI ARRESNGS YTAQNGQYYGRYQLSQS YLNGDLS PENQEK 

VADNYVASRYGSWSAALSFWNSNGWY 



>SEQ ID NO 3151:15_18RS21 frame: 1 

AS YTVKSGDTLSAIAKNHKTTVQEIjVSLNS I SNADVI S IGDVLKLDNSTASQAEAKSQPT . 
IENSMNSSSNLS SSDSAAKEEI ARRESNGS YTAQNGQYYGRYQLSQS YLNGDLS PENQEK 

VADNYWS RYGS WS AALS FWNSNGWY 



>SEQ ID NO 3152:15_2603 frame: 1 

ASYTVKSGDTLSAIAKNHKTTVQELVSLNSISNADVISIGDVLKLDNSTASQAEAKSQPT 
IENSMNSSSNLSSSDSAAKEEIARRESNGSYTAQNGQYYGRYQLSQSYLNGDLSPENQEK 
VADNYWSRYGSWSAALSFWNSNGWY 

>SEQ ID NO 3153:15_090 frame: 1 

AS YTVKSGDTLSAIAKNHKTTVQELVSLNS I SNADVI S I GDVL KLDNS KAS QAEAKSQPT 
IENSMNSSSNLSSSDSAAKEEIARRESNGSYTAQNGQYYGRYQLSQSYLNGDLSPENQEK 
VADNYWS RYGS WS AALS FWNSNGWY 

>SEQ ID NO 3154:15_A909 frame: 1 

AS YTVKSGDTLSAI AKNHKTTVQELVSLNS I SNADVI S I GDVLKLDNSTAS QAEAKSQPT 
IENSMNSSSNLSSSDSAAKEEIARRESNGSYTAQNGQYYGRYQLSQSYLNGDLSPENQEK 
VADNYVASRYGSWSAALSFWNSNGWY 



>SEQ ID NO 3155:15_CJB110 frame: 1 

AS YTVKSGDTLSAI AKNHKTTVQELVSLNS I SNADVI SI GDVLKLDNS KAS QAEAKSQPT 
IENSMNSSSNLS SSDSAAKEEI ARRESNGS YTAQNGQYYGRYQLSQS YLNGDLS PENQEK 
VADNYWSRYGSWSAALSFWNSNGWY 

>SEQ ID NO 3156:15_C0H1 frame: 1 

AS YTVKSGDTLSAI AKNHKTTVQ . LVSLNS I SNADVI S I GDVLKLDNSTAS QAEAKSQPT 
IENSMNSSSNLSSSDSAAKEEIARRESNGSYTAQNGQYYGRYQLSQSYLNGDLSPENQEK 
VADNYVASRYGSWSAALSFWNSNGWY 

>SEQ ID NO 3157:15JH36B frame: 1 

AS YTVKSGDTLSAI AKNHKTTVQELVSLNS ISNADVIS IGDVLKLDNSTASQAEAKSQPT 
IENSMNSSSNLSSSDSAAKEEI ARRESNGS YTAQNGQYYGRYQLSQS YLNGDLS PENQEK 
VADNYVASRYGSWSAALSFWNSNGWY 

>SEQ ID NO 3158: 15_JM9130013 frame: 1 

ASYTVKSGDTLSAIAKNHKTTVQELVSLNS I SNADVI SIGDVLKLDNSTTSQAEAKSQPT 
IENSMNSSSNLSSSDSAAKEEIARRESNGS YTAQNGQYYGRYQLSQS YLNGDLS PENQEK 

VADNYVAS RYGSWS AALS FWNSNGWY 

>SEQ ID NO 3159:15_M732 frame: 1 

AS YTVKSGDTLSAI AKNHKTTVQ . LVSLNS ISNADVIS IGDVLKLDNSTASQAEAKSQPT 
I ENSMNS S SNL S S S DSAAKEE I ARRESNGS YTAQNGQYYGRYQLSQS YLNGDLS PENQEK 
VADNYVASRYGSWSAALSFWNSNGWY 

>SEQ ID NO 3160:15JM781 frame: 1 

AS YTVKSGDTL SAI AKNHKTTVQ . LVSLNS I SNADVI SI GDVLKLDNSTAS QAEAKSQPT 
IENSMNSSSNLSSSDSAAKEEIARRESNGS YTAQNGQYYGRYQLSQSYLNGDLSPENQEK 
VADNYVASRYGSWSAALSFWNSNGWY 
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Table 31: Comparative Sequences relating to SAG2148 

(LysM domain protein) 

SEQ3150 AS YTVKSGDTL SAI AKNHKTTVQELVSLNS I SNADVI S I GDVLKLDNS TAS QAEAKSQPT 

SKQ3151 AS YTVKSGDTLSAIAKNHKTTVQELVSLNS I SNADVIS IGDVLKLDNSTASQAEAKSQPT 

SEQ3152 AS YTVKSGDTLSAIAKNHKTTVQELVSLNS I SNADVIS IGDVLKLDNSTASQAEAKSQPT 

SEQ3 153 AS YTVKSGDTL SAI AKNHKTTVQELVSLNS I SNADVI S I GDVLKLDNS KASQAEAKSQPT 

SEQ3154 AS YTVKSGDTL SAIAKNHKTWQELVSLNS I SNADVI SI GDVLKLDNS TAS QAEAKSQPT 

SEQ3 15 5 AS YTVKSGDTLSAIAKNHKTTVQELVSLNS I SNADVI SIGDVLKLDNSKASQAEAKSQPT 

SEQ3156 AS YTVKSGDTLSAIAKNHKTTVQ - LVSLNS I SNADVIS IGDVLKLDNSTASQAEAKSQPT 

SEQ3 157 AS YTVKSGDTLSAIAKNHKTTVQELVSLNS I SNADVIS IGDVLKLDNSTASQAEAKSQPT 

SEQ3 15 8 AS YTVKSGDTLSAIAKNHKTTVQELVSLNS I SNADVI S IGDVLKLDNSTTSQAEAKSQPT 

SEQ315-9 AS YTVKSGDTLSAIAKNHKTTVQ- LVSLNS I SNADVIS IGDVLKLDNSTASQAEAKSQPT 

SEQ3160 AS YTVKSGDTLSAIAKNHKTTVQ - LVSLNS I SNADVIS IGDVLKLDNSTASQAEAKSQPT 

SEQ3 150 IENSMNSSSNLSSSDSAAKEEI ARRESNGS YTAQNGQYYGRYQLSQS YLNGDLSPENQEK 

SEQ3 15 1 IENSMNSSSNLSSSDSAAKEEI ARRESNGS YTAQNGQYYGRYQLSQS YLNGDLSPENQEK 

SEQ3 15 2 IENSMNSSSNLSSSDSAAKEEI ARRESNGS YTAQNGQYYGRYQLSQS YLNGDLSPENQEK 

SEQ3 15 3 IENSMNSSSNLSSSDSAAKEEI ARRESNGSYTAQNGQYYGRYQLSQS YLNGDLSPENQEK 

SEQ3154 IENSMNSSSNLSSSDSAAKEEI ARRESNGS YTAQNGQYYGRYQLSQS YLNGDLSPENQEK 

SEQ3155 IENSMNSSSNLSSSDSAAKEEIARRESNGSYTAQNGQYYGRYQLSQSYLNGDLSPENQEK 

SEQ3156 IENSMNS SSNLS SSDSAAKEEI ARRESNGS YTAQNGQYYGRYQLSQS YLNGDLS PENQEK 

SEQ3 157 IENSMNSSSNLSSSDSAAKEEIARRESNGSYTAQNGQYYGRYQLSQSYLNGDLSPENQEK 

SEQ3 15 8 IENSMNSSSNLSSSDSAAKEEIARRESNGSYTAQNGQYYGRYQLSQSYLNGDLSPENQEK 

SEQ315 9 IENSMNSSSNLSSSDSAAKEEIARRESNGSYTAQNGQYYGRYQLSQS YLNGDLSPENQEK 

SEQ3 16 0 IENSMNS SSNLS SSDSAAKEEI ARRESNGS YTAQNGQYYGRYQLSQS YLNGDLS PENQEK 

SEQ3150 VADNYVASRYGSWSAALSFWNSNGWY 

SEQ3 15 1 VADNYWSRYGSWSAALSFWNSNGWY 

SEQ3152 VADNYWSRYGSWSAALSFWNSNGWY 

SEQ3153 VADNYWSRYGSWSAALSFWNSNGWY 

SEQ3154 VADNYVASRYGSWSAALSFWNSNGWY 

SEQ3 15 5 VADNYWSRYGSWSAALSFWNSNGWY 

SEQ3 15 6 VADNYVASRYGSWSAALSFWNSNGWY 

SEQ3 157 VADNYVASRYGSWSAALSFWNSNGWY 

SEQ3 15 8 VADNYVASRYGSWSAALSFWNSNGWY 

SEQ3 15 9 VADNYVASRYGSWSAALSFWNSNGWY 

SEQ3 160 VADNYVASRYGSWSAALSFWNSNGWY 
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Table 32: Conversion of ORF Ref Nos. with SAG Ref Nos. 



ORF Ref No. 


SAGxxxx Ref No. 


aa 


Annotation 


ORF00003 


SAG001 7 


M At ™ W 

447 


PcsB protein 


ORF00004 


SAGO0 18 


322 


ribose-phosphate pyrophosphokinase 


ORF00005 


SAGO0 19 


391 


aminotransferase, class I 


ORF00006 


SAG0020 


253 


recombination protein O 


ORF00008 


SAG0021 


283 


protease, putative 


ORF00009 


SAG0022 


330 


fatty acid/phospholipid synthesis protein PIsX j 


ORF00010 


SAG0023 


79 


acyl carrier protein 


ORF00011 


SAG0024 


234 


phosphoribosylaminoimidazole-succinocarboxamide 
synthase 


ORF00012 


SAG0025 


1241 


phosphoribosylformylglycinamidine synthase, putative 


ORF00013 


SAG0026 


484 


amidophosphoribosyltransferase 


ORF00014 


SAG0027 


340 


phosphoribosylformylglycinamidine cyclo-ligase 


ORF00015 


SAG0028 


182 


phosphoribosylglycinamideformyltransferase 


ORF00016 


SAG0029 


250 


acetyltransferase, GNAT family 


ORF00017 


SAG0030 


515 


phosphoribosylaminoimidazolecarboxamide 
formyltransferase/IMP cyciohydrolase 


ORF00018 


SAG0031 


283 


peptidase, M23/M37 family 


ORF00020 


SAG0032 


434 


group B streptococcal surface immunogenic protein 


ORF00021 


SAG0G33 


232 


N-acetylmannosamine-6-P epimerase, putative 


ORF00022 


SAG0034 


438 


sugar ABC transporter, sugar-binding protein 


ORF00023 


SAG0035 


295 


sugar ABC transporter, permease protein 


ORF00024 


SAG0036 


276 


sugar ABC transporter, permease protein 


ORF00025 


SAG0037 


147 


conserved hypothetical protein 


ORF00026 


SAG0038 


220 


conserved hypothetical protein 


ORF00027 


SAG0039 


305 


N-acetylneuraminate lyase, putative 


ORF00028 


SAG0040 


293 


ROK family protein 


ORF00029 


SAG0041 


325 


acetyl xylan esterase, putative 


ORF00030 


SAG0042 


267 


phosphosugar-binding transcriptional regulator, RpiR 
family, putative 


ORF00031 


SAG0043 


421 


phosphoribosylamine-glycine ligase 


ORF00032 


SAG0044 


162 


phosphoribosylaminoimidazole carboxylase, catalytic 
subunit j 


ORF00033 


SAG0045 


363 


phosphoribosylaminoimidazole carboxylase, ATPase 
subunit 


ORF00035 


SAG0046 


463 


hypothetical protein 


ORF00036 


SAG0047 


432 


adenylosuccinate lyase 


ORF00037 


SAG0048 


i~\ t~\ 4*1 

303 


transcriptional regulator, Cro/CI family 


ORF00038 


SAG0049 


332 


Holhday junction DNA helicase RuvB 


ORF00039 


SAG0050 


MAI — 

145 


phosphotyrosine protein phosphatase, low molecular 
weignt 


r~r\r\r\ A r\ 


oAbUUO 1 




mUKN moiiT Tamiiy prorein 


ORF00041 




COO 


memorane protein, putative 


ORF00042 


SAG0053 


oon 
ooO 


aldehyde-alcohol dehydrogenase 


ORF00043 


SAG0054 


338 


alcohol dehydrogenase, propanol-preferring 


ORF00044 


SAG0055 


496 


threonine synthase 




OAoUUOD 




iviM i c eiTiux Tamiiy proiem 


ORF00046 


SAG0057 


102 


ribosomal protein S10 


ORF00047 


SAG0058 


208 


ribosomal protein L3 


ORF00048 


SAG0059 


207 


ribosomal protein L4 


ORF00049 


SAG0060 


98 


ribosomal protein L23 


ORF00050 


SAG0061 


277 


ribosomal protein L2 


ORF00052 


SAG0062 


92 


ribosomal protein S19 


ORF00054 


SAG0063 


114 


ribosomal protein L22 


ORF00055 


SAG0064 


217 


ribosomal protein S3 
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ORF Ref No. 


SAGxxxx Ket No. 


aa 


Annotation 


ORF00056 




137 


nbosomai protein Lib 


vJKrUUUOo 


oAbUUbD 


DO 


nDosomai protein uzy 


I OKrOOOoy 


0A0UU0 / 


OD 


nDosomai protein oi # 


URFOOOoO 


oAoUUbo 


122 


nDosomai protein i_i4 


ORF00061 


SAoUUby 


101 


nDosomai protein Lz4 


ORF00063 


SAG0070 


180 


nbosomai protein Lo 


ORF00064 


SAG0071 


61 


nbosomai protein S14, putative 


ORF00065 


SAG0072 


132 


ribosomal protein S8 


ORF00066 


SAG 0073 


178 


ribosomal protein L6 


ORF00068 


SAG0074 


A A O 

118 


ribosomal protein L18 


ORF00069 


SAG0075 


164 


ribosomal protein S5 


ORF00070 


SAG0076 


59 


ribosomal protein L30 


ORF00071 


SAG0077 


146 


ribosomal protein L15 


ORF00072 


SAG0078 


434 


preprotein translocase, SecY subunit 


ORF00073 


SAG0079 


212 


adenylate kinase 


ORF00074 


SAG0080 


72 


translation initiation factor IF-1 | 


ORF00075 


SAG0081 


38 


ribosomal protein L36 


ORF00077 


SAG0082 


A i~\A 

121 


ribosomal protein S13 


ORF00078 


SAG0083 


118 


ribosomal protein S1 1 


ORF00080 


SAG0084 


OHO 

312 


UNA-directecl kna polymerase, aipna suounit 


ORF00081 


SAG0085 


128 


ribosomal protein L1 7 


ORF00087 


SAG0086 


97 


hypothetical protein 


ORF00088 


SAG0087 


59 


hypothetical protein 


ORF00089 


SAG0088 


56 


hypothetical protein 


ORF00090 


SAG0089 


183 


conserved hypothetical protein 


ORF00091 


SAG0090 


139 


conserved hypothetical protein 


ORF00093 


SAG0091 


144 


transcriptional regulator ComX1, putative 


ORF00094 


SAG0092 


230 


phosphoglycerate mutase family protein 


ORF00095 


SAG0093 


250 


D-alanyl-D-alanine carboxypeptidase family protein 


ORF00096 


SAG0094 


191 


N-acetylmuramoyl-L-alanine amidase, family 4 protein 


ORF00097 


oAoUU9o 


O A A 


neat-mauciDie transcription repressor nrc/A 


AOCAAAAO 

ORF0009o 


0 a /*^nnri£5 
oAC3009o 


19U 


neat snocK protein orpt: 


ORFU0099 


0 AoUUy / 


ana 

buy 


anar\ protein 


ORF00100 


SAG0098 


379 


dnaJ protein 


ORF00101 


SAG0099 


A A !~ 

415 


transcriptional regulator, GntR family 


ORF00102 


SAG0 100 


258 


tRNA pseudouridine synthase A 


ORF00103 


SAG0101 


252 


phosphomethylpynmidine Kinase, putative 


ORF00104 


SAG0 102 


154 


conserved hypothetical protein 


ORF00105 


SAG0 103 


189 


conserved hypothetical protein 


ORF00106 


SAG0 104 


280 


conserved hypothetical protein 


ORF00107 


SAG0 105 


427 


trigger factor 


ORF00108 


SAG0106 


191 


DNA-directed RNA polymerase, delta subunit, putative 


ORF00109 


SAG0 107 


534 


CTP synthase 


ORF00110 


SAG0 108 


308 


conserved hypothetical protein 


ORF001 1 1 


SAG0 109 


148 


deoxyuridine 5 x -triphosphate nucleotidohydrolase | 


ORF00112 


SAG0110 


454 


DNA repair protein RadA 


ORF00113 


SAG0111 


165 


carbonic anhydrase-related protein 


ORF00115 


SAG0112 


439 


pyridine nucleotide-disulphide oxidoreductase family 
protein 


ORF00116 


SAG0113 


484 


glutamyl-tRNA synthetase 


ORF00117 


SAG0114 


322 


ribose ABC transporter, periplasmic D-ribose-binding 
protein 
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»xi rxei no. 


OAoXXXX KeT INO. 


aa 


Annotation 


UrxrUU I I o 


OAoUl 1 0 


OHO 
OlU 


noose ABC transporter, permease protein 


vJixiUU 1 i y 


OAoUl 1 D 


AGO 


nbose ABC transporter, ATP-binding protein 


ORFnni9n 


Q A flCW -\ "7 
OAoU 1 1 / 


H OO 


noose ABC transporter protein RbsD 


ursruu \£.\ 


oAv^U MO 




riDokinase 






000 


ribose operon repressor RbsR 


UKrUU 12o 


O A f~^f\4 on 


32 


hypothetical protein 




SAG0121 


362 


permease, putative 


UKrUU120 


SAG0 122 


ooo 
228 


ABC transporter, ATP-binding protein 


t-JKrUU12b 


SAG 01 23 


AAA 

223 


DNA-binding response regulator 


nocnn-f oo 


SAG0 124 


ATA 

356 


sensor histidme kinase 


nDtrnn-i on 


0 a on on 
SAG0 125 


396 


argininosuccinate synthase 


UKrUUI oU 


SAG0 1 2d 


>l £>0 

462 


argininosuccinate lyase 


UKrUUIol 


SAG0 127 


ooo 

293 


fructose-bisphosphate aldolase 


ORF00132 


SAG0 128 


305 


L-2-hydroxyisocaproate dehydrogenase 


ORF00133 


SAG0 129 


62 


ribosomal protein L28 


ORF00134 


SAG0130 


121 


conserved hypothetical protein 


I ORF00135 


SAG0131 


543 


DAK2 domain protein 


ORF00136 


SAG0 132 


294 


SPFH domain/Band 7 family protein 


ORF00137 


SAG0 133 


38 


conserved hypothetical protein 


ORF00138 


SAG0 134 


96 


hypothetical protein 


ORF00141 


SAG0 135 


246 


ammo acid ABC transporter, ATP-binding protein 


ORF00142 


SAG0 136 


516 


amino acid ABC transporter, amino acid-binding 
protein/permease protein 


ur\ruu i*to 


unOU lo/ 


DZ / 


conserved nypotnetical protein 




onoU loo 


£1 y 


unaecaprenoi Kinase, putative 




0 a r^tni QQ 
OnbU 1 o» 




negative regulator ot competence MecA, putative 


ORF00148 


SAG0140 


386 


glycosyl transferase, group 4 family protein 


ORF00149 


SAG0 141 


256 


ABC transporter, ATP-binding protein 


ORFQ0150 


SAG0 142 


420 


conserved hypothetical protein 


ORF00151 


SAG0 143 


410 


selenocysteine lyase 


ORF00152 


SAG0144 


147 


NifU family protein 


ORF00153 


SAG0145 


472 


conserved hypothetical protein 


ORF00154 


SAG0146 


395 


penicillin-binding protein 4, putative 


ORF00155 


SAG0147 


411 


D-alanyl-D-alanine carboxypeptidase 


ORF00156 


SAG0148 


551 


oligopeptide ABC transporter, substrate binding protein, 
putative 


ORF00157 


SAG0 149 


304 


oligopeptide ABC transporter, permease protein 


ORF00158 


SAG0150 


343 


oligopeptide ABC transporter, permease protein 


ORF001 60 


SAG0151 


348 


oligopeptide ABC transporter, ATP-binding protein 


ORF00161 


SAG0152 


310 


oligopeptide ABC transporter, ATP-binding protein 


iJKrUUI DO 


OAPrt'l CO 

oALpUIoo 


283 


4-diphosphocytidyl-2C-methyl-D-erythntol kinase 




oa rrn c /i 
oAbUI o4 


A A~T 

147 


adc operon repressor AdcR 


(JKruUlbo 


O A f~*f\A IT C 

SAG0 155 


ri A 

236 


zinc ABC transporter, ATP-binding protein 


ORF00169 


SAG01 56 


270 


zinc ABC transporter, permease protein 


ORF00172 


Or\vJU 1 »J(J 




lyrosyi-irxiNM syninetase 


ORF00173 


SAG0159 


765 


penicillin-binding protein 1B, putative 


ORF00174 


SAG0160 


1191 


DNA-directed RNA polymerase, beta subunit 


ORF00176 


SAG0161 


1216 


DNA-directed RNA polymerase beta' subunit 


ORF00178 


SAG0162 


121 


conserved hypothetical protein I 


ORF00179 


SAG0163 


323 


competence protein CgIA 


ORF00180 


SAG0164 


282 


competence protein CgIB 


ORF00181 


SAG0165 


151 


conserved hypothetical protein 


ORF00182 


SAG0166 


123 


conserved domain protein 



665 



WO 2004/018646 



PCT/US2003/026827 



Table 32: Conversion of ORF Ref Nos. with SAG Ref Nos. 



UKr KeT NO. 


oAuXXXX KeT IMO. 


do 


rtn n o la II o n 


UKrUU loo 


OnoU 1 O / 


^94. 


uuiiotu vtJU i iy p 1 -' l ' iwiuoi pi uicii i 


UKrUUI o*f 


QAr^ni rr 

OnUU 1 OO 


OS f 


dOc Id LtJ Ml IClOC 


UKrUUlOD 


QAftiHi RQ 
OnoU 1 Do 




U dl loul lUHOl Idi icyuiaiui , wiu/ui i all Illy 


UKrUU I O ( 


QAf^m 7n 

OMOU 1 / U 


4*5 


h\/nnthp»tipal nrotpin 


UKrUU i oo 






hvnnthotir'ol nrotpin 
iiy [juii i tJiiucii fji w id 1 1 


UKrUU 1 oy 




991 


piUlCaoc, |JU lea Live 


UKrUUI yu 


QAf^ni 7^ 


9^ 


Ijy 1 1 will 1 c J vCi I uus\y tat,*? i cuu^ lcio 


nDCnn<i q*i 

UKruu iyi 


Qflf^ni 74 




y luioi 1 1 y lai 1 111 iL/^JC^JuuaotJ 


UKr uun y*d 


c A/^ni 7R 


7Q 


1 lyjJUll It? lineal ^/Iv^Lolll 


notnn-i no 
UKrUU! yo 


QAf^fM 7R 




rnn?pr\/pH hv/nothptip^l nrntpin 

LUI loCI VC3U 1 iy JJVJ U IC Ulial |JI ULCII 1 


(JKrUUiy4 


QACm 77 


I U 1 


LI HOI tJUUAU 1 Idlliliy (JIOICIM 


UKrUUiyt) 


Q A f~Z.C\ 1 7fl 
OMoU 1 / O 




tPMA hinrlinn Hnmain nrntpin 

i lx IN /A UlllUlliy UOIIIdlll [JiOLCIII 


ORF00196 


SAG0 179 


238 


conserved hypothetical protein 


ORF00198 


SAG0 180 


131 


single-strand uinoing protein 


ORF00199 


SAG0 181 


Z14 


nyaroiase, naioacia aenaiogenase-iiKe Tamiiy 


ORF00200 


SAG0 182 


581 


sensor nistidine Kinase, putative 


ORF00201 


«*> A /~sf\A OO 

SAG0 183 


O A & 

246 


response regulator 


ORF00203 


SAG0 184 


A CA 

151 


conserved nypotnetical protein 


j ORF00204 


SAG0 185 


242 


membrane protein, putative 


ORF00205 


SAG0 186 


3d 


nypotnetical protein 


ORF00206 


SAG0 187 


542 


oligopeptide Adu transporter, oiigopeptioe-uinaing 
proiein 


cnnon7 
UKrUUzU/ 


OnoU 1 OO 






nDcnnono 
UKrUUZUo 


oAoU i oy 


97*^ 


Uliy OpfcipilUt; nDU LI dl lopUl Id , 1 1 ICaoC pi Old! 1 


UKrUUZUy 


Q AfSOl GO 

oMou i yu 


9R7 


nontirlo ARP trjanQnrirtpr ATP-hinriinn nrntpin 

[JtJUllUtJ AADv LldiiopOILCI, AA 1 r UlllUlliy piUlCII 1 


^Dcnno^ n 
UKrUU^j. I U 


Q Af^fM Q1 
OnoU i y 1 


90ft 


n o n t i H o ARr^ tronennrfpr ATP-hinHinn nrntpin 
p"pLloc r\Lj\^ Ll dl lo^JUi lci , r\ i r uii luu iy piutcui 


UKrUU^ 1 1 


QA<^01 Q9 


R7R 


PT^5 cv/ctpm llARf^ rnmnnnpnfQ 
r i o oy o lci i i, i inuu 1 1 1 pui ici no 


UKrUUZ 


onou i yo 


^A1 


alnha amulacp familv nrntpin 
dtUI Id dlllyldoC Idlliliy jjiolciii 


UKrUUZ 14 


OMoU I y*+ 


ooy 


trariQr*rintinnal antitprminatnr Rrtlfi ffliimilv 
11 dl loOl IOLIOI Idl dl IllLci 1 1 ill JdLOl , uy I v_? idlliliy 


UKrUUZ J O 


oMou i yo 


^77 


1Q1R4A trancnnQflQP 
IO UHO, UdllojJUodoC 


UKrUUZl / 


omou i y o 


RR 
oo 


rnncpivprl rlnmain nrntpin 

UUIIOCI VCU UOIIIdlll [JIUICII 1 


UKrUUZlo 


OnuU 1 y / 




P*TQ evetpm IIR rnmnnnpnt ntitativp 
i 1 O oyoltJl l if II D oon ipoi id it, jjuiaii vc 


UKrUUZiy 


oMou i y o 


to 1 


PT^ evetpm llt^ rnmnnnpnt nutatix/p 

• lOoyoLdll, llw uuillpuiicnii puiauvc 


UKrUUZZU 


1 QAfSOIQQ 

omou i yy 


zoo 


tranckiatnlaQp M-tPrminal <?i ihi init 

LI dl lOINdOldOC, 1 M Id 1 1 HI 101 OUUUI III 


UKrUU(C£ 1 


OnOUtUU 


OUJ3 


tran<:kptnifl«iP r^-tprminal ^iiihiinit 

II dl lorxClUlaoC , u ici 1 1 ill idi ouuuiin 


UKrUUtitO 


OMOU^LU I 


41Q 


nyiHnrprii ifta^p niitfltivp 


f^t?pnn994 


< SAf^0909 

OMO UZ.Ui_ 


8Q 


rihnQnmal nrntpin S1R 


UKruU^ZO 


QAf5090^ 

OMOU/.UO 


70 Q 


nnK/rihnni mlpntiHp niinlpntiHvltransfprase 


UKrUUZZO 


OMOU*fcU** 


9^0 


rnnQPrvpH hvnnthptir*al nrntpin 

vUlioci vcu i iy jjv-fii iciivvcii piuivii i 


nppnn997 




1Q4 


Qprinp O-piPPtvltr^n^fprafifi 

Od II Iv7 w aOCLViti ai ioici uow 


UKrUUZZO 


OnOUZUD 


RO 


h\/nnthptir*al nrntpin 
i lypoiiidiodi piuicni 


UKrUU/zy 


QArin9n7 

OAOUZU / 


447 


r*\/ctoin\/l-.tRM A <5V/nthpt3^P 
oyoLcii iy i Lr\i Hr\ oyi in iciaoc 


UKruuzou 


OMoUZUO 


1 9P 


pnncan/oH h\/nnth^tir*al nrntpin 
LtUiiociVcu i lyuoii iciiodi uioidM 


ORF00231 


SAG0209 


251 


RNA methyltransferase, TrmH family, group 3 


ORF00232 


SAG0210 


172 


conserved hypothetical protein 


ORF00233 


SAG021 1 


286 


DegV family protein 


URrOU234 


OMoUZIZ 


0.9 


nypoinsiiGcii pruitJiii 


ORF00235 


SAG0213 


39 


hypothetical protein 


ORF00236 


SAG0214 


148 


ribosomal protein L1 3 


ORF00237 


SAG0215 


130 


ribosomal protein S9 


ORF00238 


SAG021 6 


33 


hypothetical protein | 


ORF00239 


SAG0217 


384 


site-specific recombinase, phage integrase family 


ORF00240 


SAG0218 


158 


transcriptional regulator, Cro/CI family 


ORF00241 


SAG0219 


101 


hypothetical protein 
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ORF Ref No. 


SAGxxxx Ref No. 


aa 


Annotation 


ORF00242 


SAG0220 


92 


conserved hypothetical protein 


ORF00243 


SAG0221 


76 


hypothetical protein 


ORF00244 


SAG0222 


108 


conserved domain protein 


ORF00245 


SAG0223 


209 


conserved hypothetical protein, fusion 


ORF00246 


SAG0224 


332 


replication initiation protein, putative 


ORF00247 


SAG0225 


144 


hypothetical protein 


ORF00248 


SAG0226 


418 


recombination protein 


ORF00249 


SAG0227 


156 


hypothetical protein 


ORF00250 


SAG0228 


111 


conserved hypothetical protein 


ORF00251 


SAG0229 


95 


conserved hypothetical protein 


ORF00252 


SAG0230 


96 


conserved hypothetical protein 


ORF00253 


SAG0231 


135 


hypothetical protein 


ORF00254 


SAG0232 


186 


hypothetical protein 


ORF00255 


SAG0233 


226 


hypothetical protein 


ORF00256 


SAG0234 


128 


hypothetical protein 


ORF00257 


SAG0235 


93 


hypothetical protein 


ORF00258 


SAG0236 


32 


hypothetical protein 


ORF00259 


SAG0237 


34 


hypothetical protein 


ORF00260 


SAG0238 


41 


hypothetical protein 


ORF00261 


SAG0239 


286 


transcriptional regulator MutR family 


ORF00262 


SAG0240 


393 


transporter, putative 


ORF00263 


SAG0241 


213 


amino acid ABC transporter, permease protein 


ORF00264 


SAG0242 


308 


amino acid ABC transporter, amino acid-binding 
protein 


ORF00265 


SAG0243 


211 


ammo acid ABC transporter, permease protein 


ORF00266 


SAG0244 


381 


amino acid ABC transporter, ATP-binding protein 


ORF00272 


SAG0245 


152 


hypothetical protein 


■^■X 1"™ /"V jr\^ ~» 

ORF00273 


SAG0246 


268 


hypothetical protein 


ORF00274 


SAG0247 


116 


hypothetical protein 


ORF00275 


SAG0248 


90 


hypothetical protein 


ORF00276 


SAG0249 


116 


hypothetical protein 


ORF00278 


SAG0250 


193 


hypothetical protein 


ORF00279 


SAG0251 


72 


transcriptional regulator, Cro/CI family 


ORF00280 


SAG0252 


186 


acetyltransferase, GNAT family 


ORF00281 


SAG0253 


192 


acetyltransferase, GNAT family 


ORF00282 


SAG0254 


226 


acetyltransferase, GNAT family 


ORF00283 


SAG0255 


315 


conserved hypothetical protein 


ORF00284 


SAG0256 


163 


RNA polymerase sigma factor, ECF subfamily 


ORF00285 


SAG0257 


53 


hypothetical protein 


ORF00287 


SAG0258 


202 


transcriptional regulator, TetR family 


ORF00288 


SAG0259 


365 


ABC transporter efflux protein, DrrB family, putative 


ORF00289 


SAG0260 


238 


ABC transporter, ATP-binding protein 


ORF00290 


SAG0261 


129 


IS1381, transposase OrfB 


ORF00291 


SAG0262 


127 


IS1381, transposase OrfA 


ORF00292 


SAG0263 


171 


hypothetical protein 


ORF00293 


SAG0264 


103 


conserved hypothetical protein 


ORF00294 


SAG0265 


235 


conserved hypothetical protein 


ORF00295 


SAG0266 


382 


N-acetyIglucosamine-6-phosphate deacetylase 


ORF00296 


SAG0267 


180 


conserved hypothetical protein 


ORF00297 


SAG0268 


304 


glycyl-tRNA synthetase, alpha subunit 


ORF00298 


SAG0269 


213 


acyl carrier protein phosphodiesterase, putative 


ORF00299 


SAG0270 


679 


glycyl-tRNA synthetase, beta subunit 


ORF00300 


SAG0271 


85 


conserved hypothetical protein 


ORF00301 


SAG0272 


87 


membrane protein, putative 
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ADC p 0 f Kt n 


j SAGXXXX Ket NO. 


aa 


Annotation 


nRFnn^n9 


bAoU<£/o 


cat 


glycerol kinase 


ur\rUUOUO 


O A ^097/1 


ouy 


alpha-glyceropnospnate oxidase 


L-Txr UUOU4 






glycerol uptake tacihtator protein 


Ur\rUUoUs) 




A AC 

440 


nadh oxidase, putative 


Ur\rUUOUO 


O A PA077 


4/b 


conserved hypothetical protein 




oAoOz / o 


661 


transketolase 


vJKrUUoUo 


SAG0279 


101 


conserved hypothetical protein 


ORF00309 


SAG0280 


244 


ABC transporter, ATP-binding protein 


ORF00310 


SAG0281 


534 


membrane protein, putative 


ORF00313 


SAG0282 


461 


PTS system, IIBC components 


ORF00314 


SAG0283 


267 


glutamate 5-kinase 


ORF00315 


SAG0284 


417 


gamma-glutamyl phosphate reductase 


ORF00316 


SAG0285 


298 


conserved hypothetical protein TIGR00006 


ORF00317 


SAG0286 


108 


cell division protein FtsL, putative 


ORF00318 


SAG0287 


752 


penicillin-binding protein 2X 


ORF00319 


SAG0288 


336 


phospho-N-acetylmuramoyl-pentapeptide-transferase 


ORF00320 


SAG0289 


447 


ATP-dependent RNA helicase, DEAD/DEAH box 
family 


UKrUUoZ i 


oAijUzyU 


270 


ABC transporter, substrate-binding protein 








amino acid ABC transporter, permease protein 


nDPnnio<i 

UlxrUUO^O 


CArtH9Q9 

oAbu^yz 




amino acid abu transporter, ATP-binding protein 




oAouzyo 


~7A 
f 4 


conserved nypotnetical protein 




OAbUZy4 


OU4 


tnioreaoxin reductase 


ORPfin^9fi 
Ur\rUUOitD 




Ana 

400 


conserved nypotnetical protein 


OR FCSH^OV 


oAou^yo 


£.1 0 


inau syntnetase 


nppnn^ofi 

Uf\rUUO<io 




AAA 

444 


ammopeptioase u 




oAou^yo 


f OU 


penicuiin-Dinding protein ia 




CAr*n9oo 

oAv^u^yy 


•4 no 

Tyy 


recomoi nation protein u 


ORF00331 


SAG0300 


172 


conserved hypothetical protein | 


- ORF00332 


SAG0301 


40 


hypothetical protein 


ORF00333 


SAG0302 


110 


conserved hypothetical protein 


ORF00335 


SAG0303 


384 


conserved hypothetical protein 


ORF00336 


SAG0304 


487 


conserved hypothetical protein 


ORF00337 


SAG0305 


160 


autoinducer-2 production protein LuxS 


ORF00338 


SAG0306 


535 


KH domain protein 


ORF00340 


SAG0307 


33 


hypothetical protein 


ORF00341 


SAG0308 




ABC transporter, ATP-binding protein, FRAMESHIFT 


ORF00343 


SAG0309 


246 


ABC transporter, permease protein, putative 


ORF00344 


SAG0310 


361 


conserved hypothetical protein 


ORF00345 


SAG031 1 




DNA-binding response regulator POINT MUTATION 


ORF00347 


SAG0312 


234 


conserved hypothetical protein 


ORF00348 


SAG0313 


209 


guanylate kinase 


ORF00349 


SAG0314 


104 


DNA-directed RNA polymerase, omega subunit, 
putative 


ORF00350 


SAG0315 


796 


primosomal protein N* 


ORF00351 


SAG0316 


311 


methionyl-tRNA formyltransferase 


ORF00352 


SAG0317 


440 


Sun protein 


ORF00353 


SAG0318 


245 


serine/threonine phosphatase, putative 


ORF00354 


SAG0319 


651 


serine/threonine protein kinase 


ORF00355 


SAG0320 


231 


conserved hypothetical protein 


ORF00356 


SAG0321 


339 


sensor histidine kinase, putative 


ORF00358 


SAG0322 


213 


DNA-binding response regulator 
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SAGxxxx Ref No. 


aa 


Annotation 


ORF00359 


SAG0323 


466 


hydrolase, haloacid dehalogenase family/peptidyl- 
prolyl cis-trans isomerase, cyclopnilin type 


UrvrUllODU 




1 -c4 


general stress protein, putative 


UKrUUoD 1 






pyruvate Tormate-iyase-activating enzyme 


! ORPriOARQ 
UKrUUOD^ 


OAoUOZO 


/CO I 


transcnptionai regulator, DeoR family 


URrUUOOJ 




097 


transcnptionai regulator, putative 


ur\ruuoo4 


OAoUoZO 




k i o system, ceiioDiose-specific IIA component 


UKrUUoOO 


oAoUoZy 


-1 AC 

1 Ub 


k 1 b system, cellobiose-specific IIB component 


UKrUUoo/ 


oACaUooU 




r I o system, cellobiose-specific HO component 


UKrUUoDo 


oAoUool 




tormate acetyitransterase 


vJKrUUoby 


oAoUool 




transaidolase family protein 


vJKrUUo/ 1 


oAoUooo 




glycerol dehydrogenase 


nDcnno7o 
UKrUUo/Z 


oAbUoo4 


oUo 


cysteine synthase A 




oAoUoOO 


^14 


conserved hypothetical protein TIGR00257 


LJKrUUo / 4 


oAoUooD 


4zy 


helicase, putative 


L/KrUUo / O 


OAP/TJQ7 

oAoUoo / 




competence protein F, putative 


LJ'KrUUo /o 


oAoUooo 


1o4 


ribosomal subunit interface protein 




CAPfllQQ 

oAbUooa 


4oU 


aspartate kinase family protein 




oAoUo4U 




hydrolase, haloacid dehalogenase-hke family 


UKrUUoo4 


oAbuo4i 


4y 


hypothetical protein 


ORF00385 


SAG0342 


263 


enoyl-CoA hydratase/isomerase family protein 


ORF00386 


SAG0343 


A A M 

144 


transcnptionai regulator, MarR family 


ORF00387 


SAG0344 


323 


3-oxoacyl-(acyl-carrier-protein) synthase III 


ORF00388 


SAG0345 


74 


acyl carrier protein 


ORF00390 


SAG0346 


319 


enoyl-(acyl-carrier-protein) reductase II 


ORF00391 


SAG0347 


308 


malonyl CoA-acyl carrier protein transacylase 


ORF00392 


SAG0348 


244 


3-oxoacyl-[acyl-carrier protein] reductase 


ORF00393 


SAG0349 


410 


3-oxoacyl-(acyl-carrier-protein) synthase II 


ORF00394 


SAG0350 


166 


acetyl-CoA carboxylase, biotin carboxyl carrier protein 


ORF00395 


SAG0351 


140 


(3R)-hydroxymyristoyl-(acyl-carrier-protein) 
dehydratase 


ORF00396 


SAG0352 


456 


acetyl-CoA carboxylase, biotin carboxylase 


ORF00397 


. SAG0353 


291 


acetyl-CoA carboxylase, carboxyl transferase, beta 
subunit 


UKrUUo9o 


SAG0354 


OCT 

257 


acetyl-CoA carboxylase, carboxyl transferase, alpha 
suounit 




OnOUOuj 


oi n, 
1 u 


conservea nypometicai protein 


urvruutuu 


o Af^n^^R 

OrtOUO JD 




seryi-iKiMA syntnetase 








nypoineiicai protein 


ORF00403 


SAG0358 


120 


conserved hypothetical protein 


ORF00404 


SAG0359 


303 


PTS system, mannose-specific 110 component 


ORFQU405 


SAG0360 


270 


PTS system, mannose-specific IIO component 


QRF00406 


SAG0361 


336 


PTS system, mannose-specific IIAB components 


ORF00407 


SAG0362 i 


270 


hydrolase, haloacid dehalogenase-like family 


ORF00408 


SAG0363 


194 


hypothetical protein 


ORF00409 


SAG0364 


203 


membrane protein, putative 


VJKrUU41 U 


O A Pnooc 
oACjUoOO 


473 


xanthine/uracil permease family protein 


ORF0041 1 


SAG0366 


169 


conserved hypothetical protein TIGR00150, putative 


ORF00412 


SAG0367 


186 


acetyltransf erase, GNAT family 


ORF00413 


SAG0368 


435 


transcriptional regulator, putative 


ORF00414 


SAG0369 


98 


conserved hypothetical protein i 


ORF00415 


SAG0370 


139 


HIT family protein 


ORF00416 


SAG0371 


167 


hypothetical protein 


ORF00417 


SAG0372 


85 | hypothetical protein j 
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ORF00419 


SAG0373 


241 


ABC transporter, ATP-binding protein 


ORF00421 


SAG0374 


344 


ABC transporter, permease protein 


ORF00422 


SAG0375 


266 


conserved hypothetical protein 


ORF00423 


SAG0376 


211 


conserved hypothetical protein T1GR00091 


ORF00424 


SAG0377 


127 


conserved hypothetical protein, POINT MUTATION 


ORF00425 


SAG0378 


379 


N utilization substance protein A 


ORF00426 


A jf^\ f\ J"\ —WA~\ 

SAG0379 


98 


conserved hypothetical protein 


ORF00427 


SAG0380 


100 


nbosomal protein L7A family 


ORF00428 


SAG0381 


927 


translation initiation factor IF-2 


ORF00429 


SAG0382 


122 


nbosome-bindmg factor A 


ORF00430 


SAG0383 


334 


conserved hypothetical protein 


ORF00431 


SAG0384 


138 


transcriptional repressor CopY 


ORF00432 


SAG0385 


744 


copper-transporter ATPase CopA 


• ORF00433 


SAG0386 


68 


copper-transporter protein CopZ 


ORF00434 


SAG0387 


204 


conserved hypothetical protein 


ORF00435 


SAG0388 


270 


hydrolase, haloacid dehalogenase-like family 


ORF00436 


SAG0389 


880 


DNA polymerase I 


ORF00437 


SAG0390 


146 


Co A binding domain protein 


ORF00438 


SAG0391 


159 


transcriptional regulator, Fur family 


ORF00439 


SAG0392 


521 


cell wall surface anchor family protein 


ORF00440 


SAG0393 


228 


DNA-binding response regulator 


^vnr - t\r\ a a a 

ORF00441 


SAG0394 


345 


sensor nistidine Kinase 


ORF00442 


SAG0395 


246 


conserved hypothetical protein 


ORF00443 


SAG0396 


380 


* a >' ' v AIA *■ ■ 4) #■ 

queuine tRNA-nbosyltransferase 


ORF00444 


SAG0397 


102 


conserved hypothetical protein 


ORF00445 


SAG0398 


179 


bioY family protein 


ORF00446 


SAG0399 


258 


AtsA/ElaC family protein 


ORF00447 


SAG0400 


168 


cytidine/deoxycytidylate deaminase family protein 


ORF00448 


SAG0401 


44 


hypothetical protein 


ORF00449 


SAG0402 


449 


glucose-6-phosphate isomerase 


ORF00450 


SAG0403 


175 


5-formyltetrahydrofolate cyclo-ligase family protein 


ORF00451 


SAG0404 


225 


rhomboid family protein 


ORF00452 


SAG0405 


347 


lipoprotein 


ORF00453 


SAG0406 


299 


UTP-glucose-1 -phosphate uridyiyltransierase 


ORF00454 


SAG 0407 


338 


glyceroi-3-pnospnate aenyarogenase (nau(P;+) 


ORF00455 


SAG0408 


109 


ribonuclease P protein component 


ORF00456 


SAG0409 


271 


SpolllJ family protein 


ORF00458 


SAG0410 


273 


R3H domain protein 


ORF00463 


SAG041 1 


177 


conserved hypothetical protein 


ORF00464 


SAG0412 


258 


RecX protein 


ORF00465 


SAG0413 


451 


RNA methyltransferase, TrmA family 


ORF00466 


SAG0414 


153 


conserved hypothetical protein 


ORF00467 


SAG0415 


142 


acetyltransferase, GNAT family 


ORF00468 


SAG0416 


1233 


protease, putative 


ORF00469 


SAG0417 


302 


glycosyl transferase, group 2 family protein 


ORF00470 


SAG0418 


336 


ribonucleoside-diphosphate reductase 2, beta subunit 


ORF00471 


SAG0419 


137 


nrdl protein 


ORF00472 


SAG0420 


721 


ribonucleoside-diphosphate reductase 2, alpha subunit 


ORF00473 


SAG0421 


1055 


conserved hypothetical protein 


ORF00474 


SAG0422 


129 


conserved hypothetical protein x 


ORF00475 


SAG0423 


132 


conserved domain protein 
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ORF00476 


SAG0424 


94 


hypothetical protein 


ORF00478 


SAG0425 


105 


carboxymuconolactone decarboxylase family protein 


ORF00479 


SAG0426 


131 


1 1 XT f " | . « 

conserved hypothetical protein 


ORF00480 


SAG0427 


129 


transcriptional regulator, MerR family 


1 ORF00482 


SAG0428 


345 


alcohol dehydrogenase, zinc-containing 


ORF00483 


SAG0429 


284 


oxidoreductase, aldo/keto reductase family 


ORF00484 


SAG0430 


287 


cation efflux system protein 


ORF00485 


SAG0431 


174 


transcriptional regulator, TetR family 


ORF00486 


SAG0432 


397 


transcriptional regulator, AraC family 


ORF00487 


SAG0433 


1389 


surface protein Rib 


ORF00488 


SAG0434 


61 


transposase, IS256 family, truncation 


ORF00489 


SAG0435 


97 


DNA-damage-inducible protein J, putative 


ORF00490 


SAG0436 


62 


hypothetical protein 


ORF00491 


SAG0437 


123 


hypothetical protein 


ORF00493 


SAG0438 


145 


bacteriophage L54a, integrase, truncation 


ORF00495 


SAG0439 




conserved hypothetical protein, FRAMESHIFT 


ORF00496 


SAG0440 


84 


conserved hypothetical protein 


ORF00497 


A jfe A A -A 

SAG0441 


103 


conserved domain protein 


ORF00499 


SAG0442 


189 


acetyltransferase, GNAT family 


ORF00500 


SAG0443 


194 


acetyltransferase, GNAT family 


ORF00501 


SAG0444 


188 


conserved hypothetical protein 


ORF00502 


SAG0445 


883 


valyi-tRNA synthetase 


ORF00503 


SAG0446 


319 


oxidoreductase, Gfo/ldh/MocA family 


ORF00504 


SAG0447 


287 


magnesium transporter, CorA family 


ORF00506 


SAG0448 


391 


transposase, IS256 family 


ORF00507 


SAG0449 


354 


conserved hypothetical protein 


ORF00508 


SAG0450 


330 


aspartate— ammonia ligase 


ORF00510 


SAG0451 


149 


bacteriocin transport accessory protein, putative 


ORF00511 


SAG0452 


179 


type II DNA modification methyltransferase, putative 


; ORF00512 


SAG0453 


A*\ A*\ 

96 


hypothetical protein 


ORF00513 


SAG0454 


161 


t i t j i * li it r 

phosphopantetheine adenylyltransferase 


ORF00515 


SAG0455 


pi «M> 

357 


conserved hypothetical protein 


ORF00518 


SAG0456 




11 1 1 A • 1 1* 1 I 

conserved hypothetical protein, degenerate 


ORF00519 


SAG0457 


192 


conserved hypothetical protein 


ORF00520 


SAG0458 


368 


conserved hypothetical protein TIGR00048 


ORF00521 


SAG0459 


A taMB A 

171 


VanZF domain protein 


I ORF00522 


SAG0460 


581 


ABC transporter, ATP-binding/permease protein 


ORF00523 


SAG0461 


579 


A Af^ M ■ A f i 1 • t a a 

ABC transporter, ATP-binding/permease protein 


ORF00524 


SAG0462 


188 


anthranilate synthase component II 


ORF00525 


SAG0463 


179 


bioY family protein 


ORF00526 


SAG0464 


/*V 

330 


biotin synthetase 


«te. mb ™P 

ORF00527 


SAG0465 


164 


hypothetical protein 


ORF00528 


SAG0466 


371 


thiolase 


ORF00531 


SAG0467 


409 


AMP-binding enzyme domain protein 


ORF00532 


SAG0468 


210 


endonuclease III 


ORF00533 


SAG0469 


131 


type IV prepiiin peptidase-related protein 


ORF00534 


SAG0470 


69 


conserved hypothetical protein 


ORF00535 


SAG0471 


322 


glucokinase 


ORF00536 


SAG0472 


126 


rhodanese domain protein 


ORF00537 


SAG0473 


613 


elongation factor Tu family protein 


ORF00538 


SAG0474 


81 


conserved hypothetical protein 


ORF00540 


SAG0475 


451 


UDP-N-acetylmuramoylalanine-D-glutamate ligase 
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ORF00541 


O A <^>n A "7 

SAG0476 


358 


UDP-N-acetylglucosamine--N-acetylrnuramyl- 
^pentapeptiae; pyropnospnoryi-unaecaprenoi in- 

PiPPtvlnliir*nQamin<^ tran^ff^r^Qfs 
o^c Ly ly i ULiUodi I ill ic. li ai ioici aoc 


ORF00S4? 


SAG0477 


^78 




ORF00544 


SAG0478 


4?9 


ppll Hiv/iciion nrnti^in Ft^A 




SAG0479 




LrCSII UIVIolUII pivJLcHI riot. 


ORF00546 


SAG0480 


224 


yimE protein, putative 


UKrUU04/ 


oAbU4o 1 




yimh protein 


UKrUUo4o 


oA(jU4o^ 


CM 

o4 


Y(jCj i tamny protein 


UKrUUo4y 


Q A^TM QO 

oA(jU4oo 


oco 
ZDZ 


yimn protein 


UKrUUOOU 


o a r»n>i qa 
oAoU4o4 


occ 

2ob 


cell division protein DivlVA, putative 


URrUUOO*: 


O A f\ A OC 

oACjU4oo 


you 


isoleucyl-tRNA synthetase 


ORF00553 


SAG0486 


100 


conserved hypothetical protein 


ORF00554 


SAG0487 


151 


MutT/nudix family protein 


ORF00555 


SAG0488 


753 


ATP-dependent Clp protease, ATP-binding subunit 


ORF00556 


SAG0489 


34 


hypothetical protein 


ORF00557 


SAG0490 


76 


conserved hypothetical protein 


ORF00558 


SAG0491 


230 


amino acid ABC transporter, permease protein 


ORF00559 


SAG0492 


244 


amino acid ABC transporter, ATP-binding protein 


ORF00560 


SAG0493 


564 


phosphoglucomutase/phosphomannomutase family 
protein 


ORF00562 


SAG0494 


284 


kit if t i^ii 

methylenetetrahydrofolate 
aenydrogenase/metnenyltetrahydrofolate 
cycionyuroiase 


ORFnn^R'* 






tfUn&erveo nypoineitcai proiein 


ORFfln^fid 
ur\ruuuo < T 






exuueoxynuonuciease vn, targe suDunn 


UixrUuOUJ 


0/"\OVJ**57 1 


71 


tiAuucUAyi \w\ fuLriccjbfcj vii, smaif suounii 


ORFnn^fifi 

\Jr\rUUJUO 




£.571/ 


y tJi cii ly in di ion dnbici doc, puiaiivo 




^AG04QQ 




h om/~il\/c in A 1 
1 Icl 1 HJiyoll 1 r\ 


ORFOfVSfift 




1S7 


ctiyilllllc ICjJIcooUl r\\ y r\, puidLlvc 


ORF00*>70 
urvruuv// w 


SAGO^OI 






ORF00571 




?7R 


n<an\/ familv nrr»toin 
ucy v ictiiiiiy piuicMii 






97Q 


1 in3Co/A/ n> \/lh\/r(rolacia ni itat\\/a 
LI[Jdoc/r\^yil lyui <Jld53ti, {JUldilvc 




SAGnsn4 


POD 


bUIIOCI VCU liypUUICUl/al JJILKfcZIII 


ORF00574 




Q1 

57 i 


r^M A-hinHinn r» roto i n Wl 1 
L/iNrA*"uii luii ly piuitJiii nvj 


ORF00*575 t 




RS 


J ly^JUtl ICLlL'al pioiciii 


ORFOO C 576 ' 




O I w 


Hi h\/H mn rntato Hohv/Hrnnonaco A 
Ull lyuivJUI UlaLc UtJI ly Ul uy til laot! r\ 


ORF00*i77 i 




41 1 


Ucld IduLdl 1 1 IcoioldllUc IdlAUl 


ORFnn^7fl 






Ucld"ldOldi 1 1 1 c?olotdl loo IdUlUI 


1 ORF00^7Q 




40R 


miirKvl rvrntoin niitati\/fi 

IIIUIIVI plUlCII 1, pULaLiVt? 


ORF00580 


SAG0511 


270 


hydrolase, haloacid dehalogenase-like family 


UKrUUOol 




4oo 


mu domain protein 


ORF00582 


SAG0513 


128 


conserved hypothetical protein 


ORF00583 


SAG0514 


894 


cation-transporting ATPase, E1-E2 family 


ORF00584 


SAGOS 15 


286 


conserved hypothetical protein 


ORF00585 


SAGOS 16 


643 


fructose-1 ,6-bisphosphatase, putative 


ORF00586 


SAG0517 


374 


iron-sulfur cluster-binding protein, putative 


ORF00587 


SAG0518 




peptide chain release factor 2, FRAMESHIFT 


ORF00588 


SAG0519 


230 


cell division ABC transporter, ATP-binding protein FtsE 


ORF00589 


SAG0520 


309 


cell division ABC transporter, permease protein FtsX 


ORF00590 


SAG0521 


236 


carboxymethylenebutenolidase-related protein ™ 


ORF00591 


SAG0522 


232 


metallo-beta-lactamase superfamily protein 
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ORF00592 


SAG0523 


254 


oxidoreductase, short chain dehydrogenase/reductase 
family 


OKrQQ593 


SAG0524 


835 


DNA polymerase Hi, epsilon subunit/ATP-dependent 
heiicase DinCj 


\Jr\rUUOyO 


OAoUOZO 




aspartate aminotransrerase 


UKruuoyo 




A AO 

44 0 


asparaginyi-tKNA synthetase 


ORF00597 


SAG0527 


185 


conserved hypothetical protein 


ORF00598 


SAG0528 


327 


mosine-undme preferring nucleoside hydrolase 


ORF00599 


SAG0529 


38 


hypothetical protein 


ORF00600 


SAG0530 


137 


OsmC/Ohr family protein 


ORF00601 


SAG0531 


296 


conserved hypothetical protein 


ORF00602 


SAG0532 


324 


conserved hypothetical protein 


ORF00603 


SAG0533 


303 


Uncharacterized BCR, COG1481 


ORF00604 


SAG0534 


465 


dipeptidase 


ORF00605 


SAG0535 


506 


zinc ABC transporter, zinc-binding adhesion iiprotein 


UKrUUbUb 


oACsUDob 


86 


ribosomal protein L31 


ORF00607 


SAG0537 


311 


DHH family protein 


ORF00608 


SAG0538 


340 


adenosine deaminase, putative 


ORF00609 


SAG0539 


147 


flavodoxin 


/-~\ n r""/"\/*\ o H /-» 

ORF00610 


SAG0540 


91 


chonsmate mutase, putative 


OKF0061 1 


SAG0541 


398 


voltage-gated chloride channel family protein 


ORF00612 


SAG0542 


127 


IS1381, transposase OrfA 


ORF00613 


SAG0543 


129 


IS1381, transposase OrfB 


ORF00614 


SAG0544 


1 15 


ribosomal protein L19 


ORF00615 


SAG0545 


359 


site-specific recombinase, phage integrase family 


Ur\ruUO 1 / 


0noU04D 


D/ 


conservea aomam protein 


UrxrUUO IO 




1<30 


nypotneticai protein 


UrxrUUD Itf 




ZOO 


repressor protein, putative 




QAfTsn^/tQ 


4 f 


nypotneticai protein 


ORFnOfi91 
UrxrUUvZ 1 


OMoUOOU 


/ 4 


conservea nypotneticai protein 


orphhroo 

VJr\rUUOa£«£ 


OMoUODl 


oz 


conservea nypotneticai protein 


UixrUUOto 


oAvjUOOZ 


DZ 


nypotneticai protein 


UfxrUUD&'r 


OnbUODj 




nypotneticai protein 




oAoUOD4 


DO 


transcriptional regulator, Oro/Cl tamily 


UrxrUUDt ( 




^4y 


antirepressor, putative 






A "7 
4/ 


nypotneticai protein 






f b 


nypotneticai protein 




oAbUOOO 


-7 A 

/4 


l^a. % a 1 4 1- apaaL 4* a ^a^ ^a^ ( a^a& h* j ' ^ 4t a ak 

nypotneticai protein 




oAbuooy 


Zob 


conservea nypotneticai protein 




CSAoUODU 


( ( 


conservea nypotneticai protein 


vJr\rUUDOO 


oAoUOD 1 


4D 


U"a % ^"4^ t*a aaav aar «-t_ l-.n_ a a«aa 

nypotneticai protein 


UKrUUDoD 


oAoUOD^i 


o4 


nypotneticai protein 


ORF00637 


SAG0563 


53 


1 t a* 

hypothetical protein 


ORF00638 


SAG0564 


160 


conserved hypothetical protein 




OAOUODO 




conservea aomam protein 


ORF00640 


SAG0566 


138 


single-strand binding protein 


ORF00641 


SAG0567 


439 


reverse transcriptase/maturase family protein 


ORF00642 


SAG0568 


67 


conserved hypothetical protein j 


ORF00643 


SAG0569 


158 


conserved hypothetical protein 


ORF00644 


SAG0570 


115 


hypothetical protein 


ORF00645 


SAG0571 


43 


hypothetical protein 


ORF00646 


SAG0572 


138 


conserved hypothetical protein 


ORF00647 


SAG0573 


54 


hypothetical protein 
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ORF00o4o 


SAG0574 


89 


conserved hypothetical protein 


ORF00649 


SAG0575 


110 


hypothetical protein 


ORF00650 


SAG0576 


43 


• lit* 1 « * 

hypothetical protein 


ORF00652 


SAG0577 


177 


conserved hypothetical protein 


ORF00653 


SAG0578 


88 


conserved hypothetical protein 


ORF00654 


SAG0581 


118 


conserved hypothetical protein 


ORF00655 


SAG0582 


422 


conserved hypothetical protein 


ORF00656 


SAG0583 


406 


conserved hypothetical protein 


A M If* 

ORF00657 


SAG0584 


62 


conserved hypothetical protein, truncation 


ORF00658 


SAG0585 


471 


conserved hypothetical protein 


! ORF00659 


SAG0586 


154 


conserved hypothetical protein 


ORF00660 


SAG0587 


300 


structural protein, putative 


ORF00661 


SAG0588 


71 


conserved hypothetical protein 


ORF00662 


SAG0589 


143 


conserved hypothetical protein 


ORF00663 


SAG0590 


112 


conserved hypothetical protein 


ORF00664 


SAG0591 


78 


conserved hypothetical protein 


ORF00665 


SAGO 592 


111 


conserved hypothetical protein 


ORF00666 


SAGO 593 


185 


structural protein 


ORF00667 


SAG0594 


81 


conserved hypothetical protein 


ORFOOooo 


SAG0595 


123 


conserved hypothetical protein 


ORF00669 


SAG0596 


670 


PblA, internal deletion 


ORrOOo/O 


SAG0597 


506 


minor structural protein, putative 


ORF00671 


SAG0598 


1374 


minor structural protein, putative 


ORF00672 


SAGO 599 


f> f% 

668 


minor structural protein, putative 


ORF00673 


SAG0600 


109 


hypothetical protein 


ORF00674 


SAG0601 


70 


hypothetical protein 


ORF00675 


SAG0602 


100 


conserved hypothetical protein 


ORF00676 


SAG0603 


111 


holin, putative 


ORF00677 


SAG0604 


239 


lysin, putative 


ORF00678 


SAG0605 


323 


conserved hypothetical protein 


ORF00679 


SAG0606 


66 


conserved hypothetical protein 


ORF00681 


SAG0607 


56 


conserved hypothetical protein 


ORF00682 


SAG0608 


59 


hypothetical protein 


ORF00683 


SAG0609 


193 


site-specific recombinase, phage integrase family 

r 


ORF00685 


SAG0610 


A A 

134 


conserved hypothetical protein 


ORF00687 


SAG0611 




transposase, degenerate FRAMESHIFT 


ORF00689 


SAG0612 


53 


conserved hypothetical protein, FRAMESHIFT 


ORF00690 


SAG0613 


425 


transmembrane protein Vexpl 


ORF00691 


SAG0614 


218 


ABC transporter, ATP-binding protein Vexp2 


ORF00692 


SAG0615 


458 


transmembrane protein Vexp3 


ORF00693 


SAG0616 


217 


DNA-binding response regulator VncR 


ORF00694 


SAG0617 


439 


sensor histidine kinase VncS \ 


ORF00695 


SAG0618 


195 


transposase OrfB, IS3 family, truncation 


ORF00697 


SAG0619 


66 


conserved hypothetical protein 


ORF00698 


SAG0620 


62 


hypothetical protein 


ORF00699 


SAG0621 


a r\A 

401 


rod shape-determining protein RodA, putativeD 


ORF00700 


SAG0622 


186 


hydrolase, haloacid dehalogenase-like family 


ORF00701 


SAG0623 


650 


DNA gyrase, B subunit 


ORF00702 


SAG0624 


574 


septation ring formation regulator EzrA, putative 


ORF00703 


SAG0625 


213 


phosphoserine phosphatase SerB 


ORF00704 


SAG0626 


161 


MutT/nudix family protein 


ORF00705 


SAG0827 


151 


conserved hypothetical protein 


ORF00706 


SAG0628 


435 


enolase 


ORF00707 


SAG0629 


354 


conserved domain protein 
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ORF00708 ! 


SAG0630 

\_//\\^ w w w w 


427 


^-nhosDhoshikimate 1 -carboxwinvltransferase 


t ORF00709 


SAG0631 


170 

1 * w 


shikimate kinase 


nRF007in * 

ur\ruu i iu i 


SAG0632 


457 


n^r nrotftin 

UOI W 1 w L w Hi 


ORF0071 1 ! 


SAG0633 


451 


RNA methvitransf erase. TrmAfamilv 

(XI* i \ III X^* M IJ > ■ ■ ■ f » till/ VI V— < 1 f lllj 


ORF00712 


SAG0634 


70 


hypothetical protein 


| OKrUU/lo | 


oAbUOoO 




^-•/-•SW nhftenhataco nror"i irc/^r* />|qcc R 
aCIQ pnUopUcUaofcJ piCLrUloUr, Utdbb O 


j OKrOO/14 | 


oACjUDOD 




conserveci nyjjuuicriiocii piumifi 


ORF00717 


oAvjUDO f 




transcnpnonai reguiaior, i etr\ ramuy, puiauve, 


[ UKrUU/ IO 1 


OMOUOOO 




r*#all wall «;i lrfepp anphnr familv nrntpin 

OCI 1 Weill Out iaoc en il>i iui i ai 1 1 ity yj 1 1_/ ici 1 1 


j {JKrUU/ZU j 






trancnn<;pc;p OrfR IS'-l f?*mil\/ 

lidt loUUoaoC \— / 1 I t — > , IwvJ iciiiiiiy 


P f*M3CTAn79*1 1 
| UKrUU/ 1 j 


onoUD*fu 


CM 1 


tranennenQp OrfA f^mil\/ 
LI cX\ loUUoaoc \S\ iww laiiiiiy 








Tn*^^? Orf 10 nrnfpin dpnpnpratp- POINT MUTATION 


ORF00723 


SAG0642 


59 


hvDotheticai protein 


1 ORF00725 


SAG0643 

W/ ^W w^ w^^ v/ 




chaDeronin. 33 kDa DEGENERATE 


1 ORF00726 


SAG0644 


402 


transcriptional regulator, AraC family 


ORF00727 ! 


SAG0645 


554 


cell wall surface anchor family protein, putative 


I ORF00728 

1 V_/ l X i WW 1 A-.W 1 


SAG0646 


307 


cell wall surface anchor family protein j 


1 ORF00729 1 

I x-/ 1 XI WW i <«w 1 


SAG0647 


305 


sortase family protein 


1 ORF00731 i 

V*/ I XI W W i V/ 1 \ 


SAG0648 


260 


sortase family protein 


ORF00732 1 


SAG0649 


890 


cell wall surface anchor family protein, putative 


\ ORF00734 ! 

I \mJl XI WW 9 \J^T 


SAG0650 


189 


sortase family protein, FRAMESH1FT 


I ORF00735 ! 

% V/l \1 WW ■ wx<J 


SAG0651 


201 


hypothetical protein 


! ORF00737 1 


SAG0653 


76 


conserved hypothetical protein, DEGENERATE 


ORF00738 

K*/ 1 XI \J\J t WW 


SAG0654 


34 


hypothetical protein 


1 ORF00740 


SAG0656 


36 


hypothetical protein 


ORF00741 

V— «* P XI WW • 1 


SAG0657 


89 


hypothetical protein 


S ORF00742 

u »w* 1 XI WW 1 


SAG0658 


383 


lipoprotein, putative 


1 ORF00743 

i V-/ 1 XI wW f » 


SAG0659 


330 


ABC transporter, ATP-binding protein 


ORF00744 

■ Xa** • xi ww^ ■ ■ r 


j SAG0660 


272 


membrane protein j 


ORF00745 

1 x»^" t xi W J # x*» 


| SAG0661 


261 


conserved hypothetical protein 


1 ORF00747 

I x»# i xi w w ■ r * 


SAG0663 


282 


cylD protein 


ORF00748 

1 1 XI W W f • V 


SAG0664 


240 


cylG protein 


ORF00749 

1. V^/ | XI WW* 1 ^# 


SAG0665 


101 


acyl carrier protein AcpC j 


! ORF00750 

| >— ' J XI WW t ww^ 


1 SAG0666 


158 


cylZ protein FRAMESH1FT 


ORF00751 


| SAG0667 


309 


cylA protein 


1 ORF00752 

1 Xbp* 1 XI W W f X**afc» 


j SAG0668 


292 


cylB protein 


ORF00753 

1 X| WW f WW 


| SAG0669 


667 


cylE protein j 


ORF00754 


j SAG0670 


317 


cylF protein 


1 ORF00755 


SAG0671 


731 


cyll protein 

aF 1 


1! ORF00756 

| ft xi x/ r n«p 


I SAG0672 


403 


cylJ protein 


ORF00757 

^bp* 1 XI V/ W | W' 1 


I SAG0673 


191 


cylK protein 


3 ORF00758 


I SAG0674 


113 


hypothetical protein 


ORF00759 


SAG0675 


171 


surface protein antigen-related protein 


j ORF00760 


j SAG0676 


ooc 
OOO 


serine protease, suDtiiase ramuy, putauve 


| vJKrUU/oT 




1062 


hvnothetical orotein 

I i y uwu i w ti ww«i i^i w wwi 1 1 


ORF00762 


SAG0678 




endopeptidase O DEGENERATE 


ORF00766 


SAG0679 


286 


hydrolase, alpha/beta fold family, putative 


ORF00767 


SAG0680 


339 


hypothetical protein 


ORF00768 


SAG0681 


353 


conserved domain protein 


ORF00769 


SAG0682 


409 


permease, putative 


ORP00770 


SAG0683 




transmembrane protein Vexp3, putative FRAMESHIFT 


j ORF00774 


I SAG0684 


223 


ABC transporter, ATP-binding protein 
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URr00775 


o a r>ncoc 
oACjUdoo 


472 


conserved nypotneticai protein 


ORr00776 




261 


UNA-entry nuclease, putative 


ORF00777 


SAG0687 


OH O 

212 


uedA family protein, putative 


ORF00778 


SAG0688 


218 


ABC transporter, ATP-binding protein 


jM^. ■ ma, B^H - - 

ORF00779 


SAG0689 


257 


membrane protein, putative 


ORF00780 


SAG0690 


272 


conserved hypothetical protein 


ORF00781 


SAG0691 


294 


transcnptional regulator, LysR family 


ORF00783 


SAG0692 


193 


regulatory protein, putative 


ORF00785 


SAG0693 


377 


IS1548, transposase 


ORF00786 


SAG0694 


173 


regulatory protein, putative, truncation 


ORF00787 


SAG0695 


330 


D-lactate dehydrogenase 


ORF00788 


SAG0696 


516 


sodiumrgalactoside symporter family protein, putative 


ORF00789 


SAG0697 


341 


2-keto-3-deoxygluconate kinase 


ORF00790 


SAG0698 


599 


beta-glucuronidase 


ORF00791 


SAG0699 


223 


transcriptional regulator, GntR family 


ORF00792 


SAG0700 


205 


2-dehydro-3-deoxyphosphogluconate aldolase/4- 
hydroxy-2-oxoglutarate aldolase 


i ORF00793 


SAG 0701 


466 


Glucuronate isomerase 


ORF00794 


SAG0702 


O A O 

348 


mannonate dehydratase 


ORF00795 


oACaO f Oo 


279 


D-mannonate oxidoreductase 


ORF00796 


SAGU704 


OTA 

270 


hydrolase, naloacid dehalogenase-like family 


ORFQ0797 


SAG0705 


596 


glycosyl hydrolase, tamily o 


AHCAA7AO 

ORF00798 


oACjU / Ob 


obi 


proline dipeptidase 


ADCAATAA 

ORF00799 




oo4 


transcriptional regulator, KegM tamily 


AOCAAOAA 

UKrUUoUO 


oACj070o 


4oo 


alpha amylase family protein 


ORF00801 


SAG0709 


332 


glycosyl transferase, group 1 family protein 


ORF00802 


SAG0710 


Add 

X T' T 


glycosyl transferase, group 1 family protein 


ORF00803 


SAG071 1 


647 


threonyl-tRNA synthetase 


a"*»k v*bbl jp»> a**i a** al 

ORF00804 


a**a» ai .a*"*. *V a* a**k 

SAG0712 


234 


a. h I at 1 * 1 • It 

DNA-binding response regulator 


ORF00805 


SAG0713 


339 


conserved hypothetical protein 


a*»*. ^P^a* aaaa *»k a*k **a a*V 

ORF00806 


a"fc A > 1 a^ J * 

SAG0714 


188 


Ik Ml 1* ■ i » 

conserved hypothetical protein 


ORF00807 


SAG0715 


216 


V- • 1 S aai aa. a ■ a • 

amino acid ABC transporter, permease protein 


ORF00808 


SAG071 6 


231 


amino acid ABC transporter, permease protein 


ORF00809 


SAG0717 


266 


amino acid ABC transporter, amino acid-binding 
protein 


ORF00810 


SAG0718 


o rr 4 

251 


amino acid ABC transporter, ATP-binding protein 


. ORF00811 


SAG0719 


236 


DNA-binding response regulator 


ORF00812 


SAG0720 


A Af\ 

449 


sensory box histidine kinase 


ORF00813 


SAG0721 


269 


metallo-beta-lactamase family protein 


ORF00814 


SAGO 722 


jA\ r\ r\ 

122 


conserved hypothetical protein 


ORF00815 


SAGO 723 


AO A 

236 


ribonuclease III 


ORF00816 


SAG0724 


A A *TA 

1 179 


SMC family protein 


ORF00817 


SAG 0725 


265 


hydrolase, haloacid dehalogenase-like family 


ORF00818 


SAG0726 


274 


hydrolase, haloacid dehalogenase-like family 


ORF00819 


SAG0727 


536 


signal recognition particle-docking protein FtsY 


ORF00820 


SAG0728 


270 


ABC transporter, substrate-binding protein 


ORF00821 


SAG0729 


300 


ABC transporter, permease protein, putative 


ORF00822 


SAG0730 


42 


ABC transporter, ATP-binding protein 


ORF00823 


SAG0731 


347 


bacterial luciferase family protein 


ORF00824 


SAG0732 


720 


transcriptional accessory protein Te*, putative 


ORF00825 


SAG0733 


142 


conserved hypothetical protein 


ORF00826 


SAG0734 


87 


phage shock protein C, putative 


ORF00827 


SAG0735 


44 


hypothetical protein 


ORF00828 


SAG0736 


311 


HPr(Ser) kinase/phosphatase 
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ORF Ref No. 


SAGxxxx Ref No. 


aa 


Annotation 


ORF00830 


SAG0737 




prolipoprotein aiacyigiyceryi transrerase 


/~\r"i r~ a aoo o 

ORF00832 


O A AA"70 O 

SAG0738 




conserveo nypoineucai proiem 


ORF00833 


SAGO/39 


14o 


conserved nypoineucai protein 


ORF00834 


SAG0740 


al 


conservea nypoineuucii proiem 


ORF00835 


O A AT it A 

SAG0741 


oUo 


peptiaase, uoz Tamiiy, putaiive 


• ORF00836 


SAG0742 


4^8 


peptidase, uoz Tamny 


/*^n^AAoo^ 

; ORF00837 


O A AA7^ O 

oAGQ74o 


-7 A 
/0 


conserveo nypoineucai protein 


ORF00838 


O A /"» AT >4 /< 

SAGO 744 


265 


memorane protein, putative 


I - /""V ^> f\ f»% /~v 

ORF00839 


SAG0745 


446 


ivin 1 iC+/rezi+ transporter, nkaivik Tamiiy 


ORF00840 


SAG0746 i 


O O A 

369 


nDotiavin Diosyntnesis protein kidu 


ORF00841 


SAG0747 


AAO 

208 


nDoiiavin syntnase, aipna suDunit 


ORF00842 


SAG0748 


OAT 

397 


nDOTiavin Diosyntnesis protein kida 


ORF00843 


SAG0749 


156 


nDotiavin syntnase, oeta suounit 


ORF00844 


SAG0750 


A AC 

496 


»ysyi-tKiNA syntnetase 


ORF00845 


SAG0751 


OA A 

300 


nyaroiase, naioacia uenaiogenase-uKe Tamny 


ORF00846 


SAG0752 


OHO 1 

213 


pnospnogiycerate mutase Tamny protein 


ORF00847 


oAGO / So 




eosu Tamny protein, putative 


ORF00848 


oAGOY 04 


one 


conservea aomain protein 


QRrUOooU 


oAbl) f OO 




peptioase, Uoz Tamny 


UKrUUooZ 


O A An7CC 


A 7 A 


conservea nypoineucai pruicjin 


OKrUUoOd 


oAv^U /Of 




lipoprotein, puiauvt? 


UKrUUoOD 


onbU f Oo 


CQQ 

oyy 


AlinnAnHAnanl'iHciCO P Al if oti\ id 

oiiyoenaopcpuucioc r, puLcaiivc 


ORrOuoob 


oAbU f oy 




pnospnociioipyruvdio cai uuxyjdot? 


ORF00857 


SAG0760 


Off 


I01548, transposase 


ORF00859 


SAG 0761 


A OO 

422 


cell division protein, rtsw/KoaA/opov t tamiiy 


ORF00861 


SAG0762 


/"V A A 

398 


translation elongation factor Tu 


ORF00863 


v A /*| A"TArt 

SAG0763 


252 


triosephosphate isomerase 


ORF00865 


SAG0764 


230 


phosphoglycerate mutase 


ORF00866 


SAG0765 


681 


penicillin-binding protein 2b 


ORF00867 


SAG0766 


198 


recombination protein RecR 


ORF00868 


SAG0767 


348 


D-alanine— D-alanine ligase 


a*to j»% 

ORF00869 


SAG0768 


455 


UDP-N-acetylmuramoylalanyl-D-glutamyl-2,o- 








aiammopimeiate— u-aianyi-u-aianyi ngase 


ORF00870 


SAG0769 


406 


oxalate:formate antiporter 


ORF00871 


SAG0770 


228 


conserved hypothetical protein 


ORF00872 


SAG0771 


512 


cell wall surface anchor family protein 


ORF00873 


SAG0772 


^— J At 

514 


peptide chain release factor 3 


ORF00874 


SAG0773 


126 


conserved hypothetical protein 


ORF00876 


SAG0774 


A\ At 

\ 244 


ABC transporter, ATP-binding protein 


ORF00878 


SAG0775 


220 


ABC transporter, permease protein 


ORF00879 


SAG0776 


276 


Hpo protei n .putative 


ORF00880 


SAG0777 


528 


ATP-dependent RNA helicase, DEAD/DEAH box 








family 


ORF00882 


SAG0778 


88 


II ale* I J ' 

conserved hypothetical protein 


ORF00883 


SAG0779 


254 


conserved hypothetical protein 


ORF00884 


SAG0780 


246 


acyltransferase family protein 


ORF00885 


SAG0781 


217 


competence protein CelA 


ORF00887 


SAG0782 


745 


DNA internalization-related competence protein 








ComEC/Rec2 


ORF00888 


SAG0783 


269 


hydrolase, haloacid dehalogenase-like family 


ORF00889 


SAG0784 


314 


sugar-binding transcriptional regulator, Lacl family 


ORF00600 


SAG0785 


330 


oontarvsd hypothetical protein 


ORF00891 


SAG0786 


242 


conserved domain protein 


ORF00892 


SAG0787 


345 


DNA polymerase III, delta subunit, putativeD 
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ORF00893 


SAG0788 


202 


superoxide aismutase, he-Mn 


ORF00894 


SAG0789 


ooo 

283 


transcriptional antiterminator Lici 


ORF00895 


SAG0790 


622 


PTS system, beta-glucosides-specific II ABC 
components 


ORF00896 


SAG0791 


A ~7 C 

475 


o-pnospno-Deta-giucosiaase 


ORF00898 


A <•"■» O"»f\0 

SAG0792 


364 


conserved nypotneticai protein 


ORF00899 


SAG0793 


O O f\ 

380 


conserved nypotneticai protein TioKUUU4o 


ORF00900 


SAG0794 


A A O 

418 


permease, GntP family 


r*"\ r~ r-x /-x jt*\ 

ORF00902 


SAG0795 


354 


conserved hypothetical protein 


ORF00903 


SAG0796 


147 


transcriptional regulator, MarR family 


ORF00904 


SAG0797 


342 


S-adenosylmethionine:tRNA ribosyltransf erase- 
isomerase 


c\ o cr r\ r\ t\ r\ c 

ORruU9U5 


oAoU / y O 




memorane proxein, putative 


ORF00906 


oACaU/yy 


Zoo 


/ill inrkoornlnrv G v-\ t-» ,tn r*\ ^ *~*s \ s~\ S^ArviAi*loA 

giucosamine-o-pnospnate isomerase 


ORF0Q9Q7 


oAGUoUU 


olo 


vjiutatnione o-transrerases domain protein 


ORF00908 


SAGUoUl 


zoy 


riDosomai small suDunit pseucouricine syntnase 


ORF00909 


c» a A/iono 
SAGOoUl 


OO 


nypotneticai protein 


UKrUUy 1U 


oAoUoUo 


OOO 


major Taciuiator Tamiiy protein 


UKrUUyi 1 


oACaUoU4 


olO 


compeience protein ooim 


(JKrUUy \£. \ 


O A/*^AQAC 

oAoUoUO 




oiigoenuopeptiaase d 


UKrUUyio 1 


oAoUoUo 




nyaroidse, riaioacia uencjioyeriaotr-iifve idrniiy 


(JKrUUyi4 


oAoUoU f 


zoo 


w-meinyiiransTerase Tamtiy protein 


UKrUUyio 


oAvjUoUo 


ouy 


protease maturation protein, puiauve 


UKrUUyio 


OAAnftfiQ 

oAbUoUs 


I O I 


conberveo nypoineticai protein 


ORF00919 


SAG0810 


872 


alanyl-tRNA synthetase 


ORF00921 


SAG081 1 


238 


membrane protein, putative 


ORF00922 


SAG0812 


272 


glycosyl transferase, family 8 


ORF00923 


SAG0813 


81 


hypothetical protein 


ORF00924 


SAG0814 


95 


conserved domain protein 


ORF00925 


SAG0815 


71 


transcriptional regulator, Cro/CI family 


ORF00926 


SAG0816 


253 


conserved hypothetical protein 


ORF00927 


SAG0817 


187 


conserved hypothetical protein 


ORF00928 


SAG0818 


319 


ribonucleoside-diphosphate reductase 2, beta subunit 


ORF00929 


SAG0819 


719 


ribonucleoside-diphosphate reductase 2, alpha subunit 


ORF00930 


SAG0820 


74 


ribonucleoside-diphosphate reductase 2, NrdH-redoxin 


ORF00931 


SAG0821 


87 


phosphocarrier protein HPr 


ORF00932 


SAG0822 


577 


phosphoenolpyruvate-protein phosphotransferase 


ORF00933 


SAG0823 


475 


■ lilt y~\ 1 | t . • 1 . 1 1 1 Jl ■ — v 

glyceraldehyde-3-phosphate dehydrogenase, NADP- 
dependent 


ORF00934 


SAG0824 


417 


polysaccharide deacetylase tamiiy protein 


ORF00935 


SAG0825 


360 


ATP-dependent RNA helicase, DEAD/DEAH box 
family 


ORF00936 


SAG0826 


209 


uridine kinase 


ORF00937 


SAG0827 


1 DO 


uuiioeivcu i lyjjvjii itJiioai piuidii 


ORF00938 


SAG0828 


554 


DNA polymerase III, gamma and tau subunits 


ORF00939 


SAG0829 


64 


conserved hypothetical protein 


ORF00940 


SAG0830 


311 


biotin— acetyl-CoA-carboxylase ligase 


ORF00941 


SAG0831 


398 


S-adenosylmethionine synthetase 


ORF00942 


SAG0832 


753 


hypothetical protein 


ORF00943 


SAG0833 


181 


hypothetical protein 


ORF00944 


SAG0834 


42 


hypothetical protein 


ORF00945 [ SAG0835 


188 


conserved hypothetical protein 
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ORF00946 


SAG0836 


184 


conserved hypothetical protein 


ORF00948 


SAG0837 


428 


ABC transporter, ATP-binding protein 


ORF00950 


SAG0838 


233 


hypothetical protein 


ORF00951 


SAG0839 


226 


transcriptional regulator, TenA family 


ORF00952 


SAG0840 


265 


phosphomethylpyrimidine kinase 


ORF00953 


SAG0841 


256 


hydroxyethylthiazole kinase 


ORF00954 


SAG0842 


223 


thiamine-phosphate pyrophosphorylase 


ORF00955 


SAG0843 


419 


UDP-N-acetylglucosamine 1-carboxyvinyltransferase 


ORF00956 


SAG0844 


184 


acetyltransf erase, GNAT family 


ORF00957 


SAG0845 


427 


CBS domain protein 


ORF00958 


SAG0846 


286 


methionine aminopeptidase, type I 


ORF00959 


SAG0847 


306 


ribonuclease BN, putative 


ORF00961 


SAG0848 


151 


GtrA family protein 


ORF00962 


SAG0849 


169 


conserved hypothetical protein 


ORF00963 


SAG0850 


652 


DNA ligase, NAD-dependent 


ORF00964 


SAG0851 


339 


BmrU protein, putative 


ORF00966 


SAG0852 


766 


puliulanase, putative 


ORF00967 


SAG0853 


622 


1 ,4-alpha-glucan branching enzyme 


ORF00968 


SAG0854 


379 


glucose-1 -phosphate adenyiyltransferase 


ORF00969 


SAG0855 




glycogen biosynthesis protein GlgD FRAMESHIFT 


ORF00971 


SAGO 856 


476 


glycogen synthase 


ORF00972 


SAG0857 


66 


ATP synthase F0, C subunit 


ORF00973 


SAG0858 


238 


ATP synthase F0, A subunit 


ORF00974 


SAG0859 


165 


ATP synthase F0, B subunit 


ORF00975 


SAG0860 


178 


ATP synthase F1, delta subunit 


ORF00976 


SAG0861 


501 


ATP synthase F1, alpha subunit 


ORF00977 


SAG0862 


293 


ATP synthase F1, gamma subunit 


ORF00978 


SAG0863 


468 


ATP synthase F1, beta subunit 


ORF00979 


SAG0864 


137 


ATP synthase F1 , epsilon subunit 


ORF00980 


SAG0865 


76 


conserved hypothetical protein 


ORF00981 


SAG0866 


423 


U D P-N-acety Ig I ucosam i n e 1 -carboxyvi nyltransf erase 


ORF00982 


SAG0867 


63 


conserved hypothetical protein 


ORF00983 


SAG0868 


285 


DNA-entry nuclease 


ORF00984 


SAG0869 


346 


phenylalanyl-tRNA synthetase, alpha subunit 


ORF00985 


SAG0870 


173 


acetyltransf erase, GNAT family 


ORF00986 


SAG0871 


801 


phenylalanyl-tRNA synthetase, beta subunit 


ORF00987 


SAG0872 


300 


conserved hypothetical protein 


ORF00988 


SAG0873 


1077 


exonuclease RexB 


ORF00989 


SAG0874 


1207 


exonuclease RexA 


ORF00990 


SAG0875 


305 


magnesium transporter, CorA family, putative 


ORF00991 


SAG0876 


458 


tRNA modification GTPase TrmE 


ORF00992 


SAG0877 


636 


ABC transDorter ATP-bindina Drotein 


ORF00993 


SAG0878 


322 


acetoin dehvdroaenase thvmine PPi dependent E1 
component, alpha subunit 


ORF00994 


SAG0879 


332 


acetoin dehydrogenase, thymine PPi dependent, E1 
component, beta subunit 


ORF00995 


SAG0880 


462 


acetoin dehydrogenase, thymine PPi dependent, E2 
component, dihydrolipoamide acetyltransferase 


ORF00996 


SAG0881 


585 


acetoin dehydrogenase, thymine PPi dependent, E3 
component, dihydrolipoamide dehydrogenase 


ORF00997 


SAG0882 


329 


lipoate-protein ligase A 


ORF00998 


SAG0883 


261 


cobyric acid synthase, putative 
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ORF00999 


SAG0884 


447 


mur ligase family protein 


ORF01000 


SAG0885 


283 


conserved hypothetical protein TIGR00159 


ORF01001 


SAG0886 


319 


Gram-positive signal peptide, YSIRK family domain 
protein 


ORF01002 


SAG0887 


450 


phosphoglucomutase/phosphomannomutase family 
protein 


ORF01003 


SAG0888 


123 


conserved hypothetical protein 


ORF01004 


SAG0889 


126 


conserved hypothetical protein 


ORF01005 


SAG0890 


376 


oxygen-independent coproporphyrinogen III oxidase, 
putative 


ORF01006 


SAG0891 


245 


conserved hypothetical protein 


ORF01007 


SAG0892 


256 


hydrolase, haloacid dehalogenase-like family 


ORF01008 


SAG0893 


218 


conserved hypothetical protein 


ORF01009 


SAG0894 


1370 


conserved hypothetical protein 


ORF01010 


SAG0895 


289 


lipoyl-binding domain protein 


ORF01011 


SAG0896 


108 


oxidoreductase, putative 


ORF01012 


SAG0897 


221 


conserved hypothetical protein j 


ORF01013 


SAG0898 


83 


hypothetical protein 


ORF01014 


SAG0899 


57 


hypothetical protein 


ORF01015 


SAG0900 


56 


hypothetical protein 


ORF01016 


SAG0901 


127 


hypothetical protein 


ORF01018 


SAG0902 


45 


hypothetical protein 


ORF01019 


SAG0903 


44 


hypothetical protein 


ORF01021 


SAG0904 


56 


hypothetical protein 


ORF01022 


SAG0905 


138 


nucleoside diphosphate kinase 


ORF01023 


SAG0906 


610 


GTP-binding protein LepA 


ORF01024 


SAG0907 


877 


streptococcal histidine triad family protein 


ORF01025 


SAG0908 


203 


HD domain protein 


ORF01026 


SAG0909 


154 


acetyltransferase, GNAT family 


ORF01027 


SAG0910 


144 


PilB-related protein 


ORF01030 


SAG091 1 


930 


cation-transporting ATPase, E1-E2 family 


ORF01031 


SAG0912 


367 


nucleoside diphosphate kinase domain protein 


ORF01032 


SAG0913 


212 


chloramphenicol acetyltransferase 


ORF01033 


SAG0914 


203 


conserved hypothetical protein 


ORF01034 


SAG0915 


405 


Tn916, transposase | 


ORF01035 


SAG0916 


67 


Tn916, excisionase 


ORF01037 


SAG0918 


76 


Tn916, hypothetical protein 


ORF01038 


SAG0919 


157 


Tn916, hypothetical protein 


ORF01039 


SAG0921 


117 


Tn916, transcriptional regulator, putative 


ORF01040 


SAG0923 


639 


Tn916, tetracycline resistance protein 


ORF01041 


SAG0925 


310 


Tn916, hypothetical protein 


ORF01042 


SAG0926 


333 


Tn916, NLP/P60 family protein 


ORF01044 


SAG0927 


725 


Tn916, hypothetical protein FRAMESHIFT 


ORF01047 


SAG0928 




Tn916, hypothetical protein FRAMESHIFT 


ORF01048 


SAG0929 


168 


Tn916, hypothetical protein 


ORF01049 


SAG0930 


165 


Tn916, hypothetical protein 


ORF01050 


SAG0931 


73 


Tn916, hypothetical protein 


ORF01051 


SAG0932 


401 


Tn916, transcriptional regulator, putative 


ORF01052 


SAG0933 


461 


Tn916, FtsK/Spol!IE family protein 


ORF01053 


SAG0934 


128 


Tn916, hypothetical protein 


ORF01054 


SAG0935 


104 


Tn916 t hypothetical protein 


ORF01056 


SAG0937 




ABC transporter, ATP-binding protein, FRAMESHIFT 


ORF01057 


SAG0938 


122 


transcriptional regulator, GntR family 


ORF01058 


SAG0939 


1034 


DNA polymerase III, alpha subunit 
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| UKP KeT IMO. ( 


O A C^. w w Kin 1 
oAvaXXXX rvCI rMO. j 


aa | An notation 


| UKrU I uoy j 




340 ! 


6-phosphofructokinase j 


j Ur\rU I UDU | 


QAr^OQAI 1 


500 


pyruvate kinase j 


ORF01061 


SAG0942 


185 


signal peptidase I, putative j 


| ORF01062 | 


SAGU94o | 


47 


hypothetical protein | 


j ORF01063 


SAG0944 


604 

I 


giucosamine~fructose-6-phosphate aminotransferase | 
(isomenzing) ! 


1 vJKrU1Uo4 j 




377 


IS1548, transposase I 


j UKrUlUbb j 




109 I 


phnA protein ( 


1 UKrUlUoo j 


oAoUy^ / | 


213 


amino acid ABC transporter, permease protein j 


| OKrOIUoy 1 


oAoUy4o 1 


209 


amino acid ABC transporter, ATP-binding protein 


ORF01070 


SAG0949 


276 


amino acid ABC transporter, amino acid-binding | 
protein 


ORF01072 


SAG0950 


82 


ribosomal protein S20 | 


ORF01073 


SAG0951 | 


306 


pantothenate kinase j 


ORF01074 


SAG0952 | 


196 


conserved hypothetical protein 


ORF01075 


SAG0953 | 


129 


cytidine deaminase j 


ORF01076 


SAG0954 | 


349 


lipoprotein 


ORF01077 | 


SAG0955 | 


511 


sugar ABC transporter, ATP-binding protein 


ORF01078 


SAG0956 


353 


sugar ABC transporter, permease protein, putative 

- 


OKrOIO/y 


QA/^AQC;7 I 

oAouyo / I 


318 


sugar ABC transporter, permease protein, putative 1 


I orfoi nnn 


SAG0958 1 


4oo j 


NAUn oxidase _j 




i SAG0959 1 


oon I 

oZv j 


L-iactate oenyarogenase j 


! UrvrUIUOt I 


! SAG0960 I 


O A (~\ I 

819 { 


UinA gyrase, a suounit 


i ORFOinfl'* 

1 Utxru 1 UOO i 


! SAG0961 ! 


247 j 


sortase ortA | 


I uiAruiuoH 1 j 


SAG0Q62 


137 J 


giyoxyiase Tamiiy proiein j 


1 ORFfi1085 


SAG0963 1 


oZu I 


conssrvea nypoxneiicai proiein | 


1 ORFOIOfifi ! 


! SAG0964 


Of o j 


iNa+/ri + excnanger jamiiy pruioni | 


1 ORF01087 i 


SAG0965 ! 


a 07 I 

I I 


to loo i, transposase uiim 


1 ORF01088 


1 SAG0966 1 


l i zy 


j io loo i , iransposase ui i d j 


1 ORF01089 


! SAG0967 


1 con i 


j oiviK synxnase 


1 ORF01090 


! SAG0968 




1 iranscripuonai reguiaior, vjnirs. idituiy 


II ORF01091 


SAG0969 


444 1 


gid protein j 


ORF01092 


SAG0970 


! 247 


1 acetyltransferase, GNAT tamiiy j 




I onouy r > 


282 


| lipoprotein .putative j 


ORF01 095 


SAG0972 




1 conserved hypothetical protein, FRAMfcSHIFT j 


\ ORF01096 


j SAG0973 


320 


nisin-resistance protein, putative 


j ORF01097 


j SAG0974 


250 


| ABC transporter, ATP-binding protein j 


| ORF01098 


| SAG0975 


651 


j ABC transporter, permease protein, putative ] 


j ORF01099 


j SAG097D 


222 


j DNA-binding response regulator j 


ORF01100 


\ SAG0977 


312 


j sensor histidine kinase j 


j ORF01101 


SAG0978 


356 


jsite-specific recombinase, phage integrase family 


\ ORF01102 


j SAG0979 


553 


ABC transporter, substrate binding protein, putative 


( UKrUi iuo 


j O /AVJ \J\JU\J 


257 


1 conserved hypothetical protein j 


ORF01104 


SAG0981 


228 


SatD 


ORF01106 


SAG0982 


521 


| signal recognition particle protein 


ORF01108 


SAG0983 


110 


1 conserved hypothetical protein 1 


ORF01109 


SAG0984 


437 


1 sensor histidine kinase CiaH 


ORF01110 


SAG0985 


226 


DNA-binding response regulator CiaR 


I ORF01111 


SAG0986 


849 


aminopeptidase N 


ORF01112 


SAG0987 


217 


phosphate transport system regulatory protein PhoU 
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ORF01113 


SAG0988 


252 


phosphate ABC transporter, ATP-binding protein PstB, 
putative 


ORF01114 


SAG0989 


267 


phosphate ABC transporter, ATP-binding protein PstB, 
putative 


ORF01115 


SAG0990 


295 


phosphate ABC transporter, permease protein PstA, 
putative 


ORF01116 


SAG0991 


305 


phosphate ABC transporter, permease protein 


ORF01117 


SAG0992 


286 


phosphate ABC transporter, phosphate-binding protein 


ORF01118 


SAG0993 


436 


NOL1/NOP2/sun family protein 


ORF01119 


SAG0994 


254 


inositol monophosphatase family protein 


ORF01120 


SAG0995 


93 


conserved hypothetical protein 


ORF01121 


SAG0996 


137 


conserved hypothetical protein 


ORF01122 


SAG0997 


310 


macrolide-efflux protein mreA/riboflavin biosynthesis 
protein RibF 


ORF01123 


SAG0998 


294 


tRNA pseudouridine synthase B 


ORF01124 


SAG0999 


143 


acetyltransferase, GNAT family 


ORF01125 


SAG1000 


423 


conserved hypothetical protein 


ORF01126 


SAG1001 


196 


conserved hypothetical protein 


ORF01127 


SAG 1002 


292 


protease, putative 


ORF01128 


SAG 1003 


876 


permease, putative 


ORF01129 


SAG1004 


233 


ABC transporter, ATP-binding protein 


ORF01131 


SAG 1005 


706 


DNA topoisomerase 1 


ORF01132 


SAG1006 


280 


DprA/SMF protein, putative DNA processing factor 


ORF01133 


SAG1007 


342 


iron-compound ABC transporter, iron-compound- 
binding protein 


ORF01134 


SAG1008 


253 


iron compound ABC transporter, ATP-binding protein 


ORF01135 


SAG1009 


324 


iron compound ABC transporter, permease protein 


ORF01136 


SAG1010 


320 


iron compound ABC transporter, permease protein 


ORF01137 


SAG1011 


182 


acetyltransterase, cysc/LacA/LpxA/Nodi_ tamiiy 


ORF01 138 


SAG1012 


253 


ribonuclease nil 


ORF01139 


SAG1013 


283 


vj 1 p-oinding protein 


ORF01 140 


SAG1014 


190 


conserved hypothetical protein 


ORF01 142 


O A /*"» <4 f\A C 

SAG1015 


494 


carbon starvation protein ostA, putative 


ORF01 143 


SAG1016 


244 


response regulator 


ORF01144 


SAG1017 


579 


sensor histidine kinase, putative 


ORF01145 


SAG1018 


40 


hypothetical protein 


ORF01 146 


O A /"» A t\A f\ 

SAG1019 


39 


conserved hypothetical protein, rKAMbonlr l 


/—\r*i r~ /*v a a Act 

ORF01 148 


SAG 1020 


227 


hypothetical protein 


ORF01 149 


SAG 1021 


107 


hypothetical protein 


ORF01150 


SAG 1022 


A TT 

177 


hypothetical protein 


ORF01151 


SAG 1023 


48 


hypothetical protein 


ORF01152 


SAG 1024 


183 


hypothetical protein 


ORF01153 


SAG 1025 


149 


hypothetical protein 


ORF01156 


SAG 1026 




immunogenic secreted protein, DEGENERATE 


ORF01157 


SAG 1027 


84 


conserved hypothetical protein 


ORF01158 


SAG 1028 


196 


hypothetical protein 


ORF01159 


SAG 1029 


101 


hypothetical protein 


ORF01160 


SAG 1030 


304 


conserved hypothetical protein 


ORF01161 


SAG1031 


120 


extracellular protein, putative POINT MUATION 


ORF01162 


SAG1032 


85 


conserved hypothetical protein 


ORF01164 


SAG 1033 


1309 


FtsK/SpolllE family protein 


ORF01166 


SAG 1034 


55 


hypothetical protein 
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ORF01167 


SAG1035 


424 


conserved nypotnetical protein 


ORF01168 


SAG 1036 


80 


conserved hypothetical protein 


ORF01 169 


SAG 1037 


A tr~r 

157 


hypothetical protein 


ORF01 172 


SAG 1038 


A AAO 

1003 


phage intection protein, putative 


ORF01 173 


SAG1 039 


9b 


conserved nypotnetical protein 


ORF01 174 


SAG 1040 


260 


conserved domain protein 


ORF01175 


SAG 1041 


107 


hypothetical protein 


ORF01176 


SAG 1042 


1060 


carDamoyl-phosphate synthase, large suuunit 


ORF01177 


r - * a a r\ a*> 

SAG 1043 


358 


carbamoyl-phosphate synthase, small subunit 


ORF01178 


SAG 1044 


0 r"\ "7 

307 


aspartate carbamoyltransferase 


ORF01179 


SAG 1045 


430 


dihydroorotase, multifunctional complex type 


ORF01180 


SAG 1046 


209 


orotate phosphoribosyltransferase 


ORF01181 


SAG 1047 


233 


orotidine 5 -phosphate decarboxylase 


ORF01182 i 


SAG 1048 


A A r\ 

410 


membrane protein, putative 


ORF01183 j 


SAG 1049 


C A O 

513 


ABC transporter, ATP-binding protein 


ORF01 184 


<— \ a y— v A f\CA 

SAG1 050 i 


1 12 


nbonucleotide reductase, truncation 


/—\r" i^r\A 4 or 

ORF01 185 


oAolOol 


O CO 

35o 


aspartate-semiaidenyae denyarogenase 


ORF01 186 


O A r^A ACO 

SAG1 OoZ 


47 


cell wall surface ancnor tamuy protein, putative 


/>~\ r— * w—f \ A A a "T 

ORF01187 


SAG 1053 


OA 


nypotnetical protein 


ORF01188 


bAo1Uo4 : 


CO -i 

531 


caraioiipin syntnetase 


ORF01 189 


0 a a actc 


000 


tormate—tetranyarotoiate ngase 


/^>r—> r— f\ <4 H AA 

ORF01 190 


oAOIUob 


OOQ 


npoate-protem ngase a 


/^oirrv<< a c\a 

ORF01191 | 


O A /"* -1 AC "7 

oAvjIOoa 




^r-k, — >. u. , jk- Lk> k J Uk. /V ^ Lk. ^k dk AW* 1 k~V k^ X | kv*> 

conserved nypotnetical protein 


ORF01 192 


O A «f AC Q 




yrk ,^-k kyk jtf-k kkm / «— J Lk. k j k-k /— v 4" V~\ At*\j^Z j*k ^-k 1 an At v-k 

conserved nypotnetical protein 


ORF01193 


SAG 1059 


A A A 

no 


glycine cleavage system n protein, putative 


ORF01194 


SAG 1060 


000 


bacterial luciterase tamuy protein 


ORF01195 


O A /~* A f~\0 A 

SAG 1061 


O A A 

399 


^kk. k aa j-J jkkk k»«*k i~ k! • ■ yklaK ^aw ^kv I™"* ft Jl Pk 1 Lk\ i Mki ^ Jj a a0kh ^aa 

oxidoreductase, rMN-Dinding 


ORF01197 


O A A ACO 

SAG 1062 


000 


1 ^*ki 4- rf-k a/kk k«jk.Xyk a k«a 1 1 y*^a ^*a ^<k. ■kk.kw [It | krkv k—jXk X ^k, t fek 

upo ate- protein ngase A tamuy protein 


ORF01198 


O A -4 ACO 
OA0IODO 


000 
z2o 


iiavoprotein-reiated protein 


ORF01199 


SAG 1064 


•1 OA 
loU 


tiavoprotem tamuy protein 


ORF01200 


O a f**A ACC 

SAG1 065 


A aa 
190 


.kk AT*k ■.■jk In ha an, ^kk ar^. Wi^*k X a «k ak, a a X Xf t #^kk 

membrane protein, putative 


/— \ r-* r~ r\A *~\r\A 

i ORF01201 


OA f*^H ACC 

oAGIObo 


Of z. 


L-k jkk. ^>k mykk Lax ak ^Sa 1 * a ^kv ^k. kkjk a |4/k ^kv *k. 

pnospnogiucomutase 


ORF01202 


SAG 1067 


A 7Q 
1 /O 


loobl , transposase urtA 


ORF01203 


SAG1 068 ^ 


277 


1 O d A. Xaaf^kv k«aa ^k. kkk, - k A /\ /~\ aJQ 

ibobi , transposase urto 


ORF01204 


SAG1 069 


cc 

bo 


Laa a a a*kk X Lax. X ■ | n k* ^"*k X>»"v ■ 

nypotnetical protein 


ORF01205 


SAG 1070 


Off 


A. 1 t J»k*a**k Ha /— k aa «av ka t ^k. aat TT*1^J^ Lk. ■ wa a aak ^ka / k—v v"*k #a »k>^ /*. /k w u>a 4v j» a aA 

Abu transporter, a I r-bincmg/permease protein 


/•— y 1—1 r— /"X ,4 OrtO 

ORF01206 


SAG1071 


£-715 

573 


A. 1™ ^ Xkidk aaa. ^k. aaai rf-k ■- t- aa* ~ ^™ f~~ # Lav ■ *«~k a-J a aaWa ^ka / kVkt ^kv iaaM ^k ^a> a k— k ks«kv. ^aa a k^k\ 

ABC transporter, A 1 P-binding/permease protein 


ORF01207 


SAG1 072 


OAA 

200 


^k\^av aa% A ^kv aa M^k. *J Lkv k a «H ^k> X Lk^ aaX ■ a jk 1 a>k. »^X/k a an 

conserved nypotnetical protein 


ORF01208 


OAG1073 


00c 


M i— % yk Ha i y-k. /^J Laa a « aM ^k. X tak y*k, X« a^,<*a 1 %-k KA^A 1 k<*a 

conservea nypotnetical protein 


/— v j— 1 r~r\ a OAA 

ORF01209 


OA P«( Pit A 

oAG10a4 


A A Q 

41 0 


*- | fc^a. v— k L-k ( » /— J ^ /— \ k /-* /■v.nitk. j I 4*1— n rt+A r"y— . /—a >rkv 

oerine nyoroxymemyitransTerase 


ORF01210 


O A A A"7C 
OAblU/O 


4 0.0 
too 


ouao/Yciu// rrov-f/ t wiLr Tamuy protein 


r~\rTi CTA h 04 4 

(JKrOIZI 1 


OAblU/O 


Z/ O 


mouiiication meinyiase, rtemr\ Tamuy 




OAp^ A77 


IRQ 


peptiae cnam release Tacior i 


ORF01213 


O A l~*A A70 

SAG1 078 


-1 OA 

1o9 


X klkk k a j i i i a *— J a i_ i_ » 1 >■ ■ k«ki j . ^ . 1 » y*m, 

thymidine Kinases 


ORF01214 


SAG1079 


60 


4-oxalocrotonate tautomerase 


ORF01215 


SAG1080 


47 


hypothetical protein 


ORF01216 


SAG 1081 


312 


ApbE family protein 


ORF01217 


SAG1082 


200 


conserved hypothetical protein 


ORF01218 


SAG1083 


411 


conserved hypothetical protein 


ORF01219 


SAG1084 


262 


formate/nitrite transporter family protein 


ORF01220 


SAG1 085 


424 


xanthine permease 


ORF01221 


SAG1086 


193 


xanthine phosphoribosyltransferase 


0^01222 


SAG1087 


327 


guanoslne monophosphate reductase 


ORF01223 


SAG1088 


446 


drug resistance transporter, EmrB/QacA family, 
putative 
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ORF01224 


SAG 1089 


230 


conserved hypothetical protein 


ORF01225 


SAG1090 


666 


potassium uptake protein, putative 


ORF01226 


SAG1091 


216 


oxidoreductase, short chain dehydrogenase/reductase 
family, FRAMESHIFT 


ORF01227 


SAG1092 


330 


phosphate acetyltransferase 


ORF01228 


SAG109o 


294 


ribosomal large subunit pseudouridine synthase, RluD 
suDiamuy 


l_Jr\r U 1 ZZy 




97ft 
Z/O 


conservea nypotneticai protein 


UKrU IZoU 


onb i uy O 


111 

ZZo 


o i r pyropnospnoKinase Tamiiy protein 


iJKrUlzol 


oAbiuyo 


1 on 

»yu 


conserved nypotneticai protein 


L/KrUlzoZ 


oAvji uy / 


OZ4 


noose-pnospnate pyropnospnoKinase 


CJKrU 1 zoo 


CAP«inQQ 

oAbiuyo 


Of I 


cysteine desulpnurase 


LJKrU1zo4 


oAoiuyy 


no 


conserved nypotneticai protein 


ORF01235 


SAG1100 


210 


DNA-binding protein 


ORF01236 


SAG1101 


**** J^*V 

226 


DNA repair protein RadC 


ORF01237 


SAG1102 


377 


membrane protein, putative 


ORF01238 


SAG 11 03 


478 


6-phospho-beta-glucosidase 


ORF01239 


SAG1104 


204 


platelet activating factor, putative 


ORF01240 


SAG1105 


273 


hydrolase, haloacid dehalogenase-like family 


ORF01241 


SAG1106 


309 


transcriptional regulator, AraC family, putative 


ORF01242 


SAG 11 07 


510 


voltage-gated chloride channel family protein 


ORF01243 


SAG1108 


357 


spermidine/putrescine ABC transporter, 
spermidine/putrescine-binding protein 


ORF01244 


SAG1109 


258 


spermidine/putrescine ABC transporter, permease 
protein 


ORF01245 


SAG1110 


264 


spermidine/putrescine ABC transporter, permease 
protein 


ORF01246 


SAG1111 


384 


spermidine/putrescine ABC transporter, ATP-binding 
protein 


ORF01247 


SAG1112 


300 


UDP-N-acetylenolpyruvoylglucosamine reductase 


ORF01248 


SAG1113 


162 


2-amino-4-hydroxy-6-hydroxymethyldihydroptendine 
pyrophosphokinase 


Ur\rU IZ*fy 


O/Ao 1114 


19H 
I ZU 


□inyuroneopterin aiooiase 


Ur\rU IZOU 


o a r^i 1 1 ^ 


9ft7 i 
ZOf 


ainyaropieroate syninase 


V«/r\rU IZO I 


Onvj I I I O 


1ft7 


o i jt cycionyaroiase i 


npcni 9^9 

Ur\rU IZOZ 


OAfi-l 1-17 
OMo III/ 


4ZU 


Toiyipoiygiutamate syninase 


WrxiU I ZOO 


Ortvj I I I O 


9Q^ 

zyo 


raru proxein 


UrxrU 1 ZO*f 


QA01 1 1Q 


9ftft 
Zoo 


nomosenne Kinase 


odpoi 9*^ 
vJrvru i zoo 


oMb I I ZU j 


A07 

*+z / 


nomosenne aenyarogenase 


UrxrU I zoo 


Q Aril 191 ! 
OrVJJ I I Z I 


90A 

zyo 


poiysaccnanue aeacetyiase Tamiiy protein 


UrxrU IZO/ 


Q A r^1 1 99 
Onb I I ZZ 


^l f\ 
O 1 o 


iransponer, dl»u f Tamiiy protein 


nppni 9^ ft 

URrU i ZOO 


QA^I 1 OO 
Onb I I Zo 


*XA 


nypoinencai protein 


vjKrLn zoy 


oAbl i Z4 


40O 


aioenyde denyarogenase tamiiy protein 


ORF01260 


SAG 1125 


f\ f\ I™ 

335 


membrane protein 


/-\Dirni ofti 
Ur\ru i ZD J 


O A<^i1 1 9ft 

oAbl izo 


99ft 
ZZO 


conserved nypotneticai protein 


ORF01262 


SAG1127 


113 


conserved hypothetical protein, FRAMESHIFT 


ORF01263 




187 


hypothetical protein 


ORF01264 


SAG1128 


65 


transcriptional regulator, Cro/CI family 


ORF01265 


SAG 11 29 


36 


hypothetical protein 


ORF01266 


SAG1130 


49 


hypothetical protein 


ORF01268 


SAG1131 


164 


thiol peroxidase 


ORF01269 


SAQ1132 


219 


conserved hypothetical protein 


ORF01272 


SAG1133 


254 


conserved hypothetical protein 
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ORF01273 


SAG 11 34 


213 


transcriptional regulator, GntR family/potassioum 

tat a ■ i ■ ■ * f *| 

uptake protein, TrkA family 


ORF01274 


SAG 11 35 


183 


gls24 protein, putative 


ORF01275 


SAG 11 36 




ft! 1 1 _ - J. * _ 1 . I ar~+ F" X A. A m t m^ | fa | ■ ■ ■ ■ 

conserved hypothetical protein FRAMESH1FT 


ORF01 276 


SAG 11 37 


180 


gls24 protein, putative 


ORF01277 


SAG1138 j 


64 


conserved hypothetical protein 


ORF01 279 


SAG1139 


193 


conserved hypothetical protein 


ORF01280 


SAG 1140 


82 


conserved hypothetical protein 


ORF01281 


SAG1141 


112 


conserved hypothetical protein 


ORF01282 


SAG1142 


759 


ATP-dependent DNA helicase PcrA 


ORF01283 


SAG 11 43 


100 


conserved hypothetical protein, FRAMES HI FT 


ORF01284 


SAG 11 44 


441 


uracil permease 


ORF01285 


SAG 1145 


448 


sodium:alanine symporter family protein 


ORF01286 


SAG 11 46 


411 


cation efflux family protein 


ORF01287 


SAG 11 47 


130 


conserved hypothetical protein 


ORF01288 


SAG1148 


231 


membrane protein, putative 


ORF01289 


SAG1149 


207 


conserved hypothetical protein 


ORF01290 


SAG1150 


400 


nbosomal protein S1 


ORF01291 


SAG1151 


76 


conserved hypothetical protein 


ORF01292 


SAG 11 52 


340 


branched-chain amino acid aminotransferase ; 


ORF01294 


A M% _^ A MM ^% 

SAG1153 


819 


DNA topoisomerase IV, A subunit 


ORF01295 


SAG 11 54 


/—\ MM f% 

653 


DNA topoisomerase IV, B subunit 


ORF01296 


SAG1155 


207 


conserved hypothetical protein TIGR00023 


ORF01297 


SAG 11 56 


217 


uracil-DNA glycosylase 


ORF01298 


SAG1157 


161 


conserved hypothetical protein 


ORF01299 


SAG1158 


413 


CMP-N-acetylneuraminic acid synthetase NeuA 


ORF01300 


SAG 11 59 


209 


neuD protein 


^■fc. pM MM ^M ^ ,Mfc M J 

ORF01301 


SAG1160 


384 


UDP-N-acetylglucosamine-2-epimerase NeuC 


ORF01302 


SAG1161 


341 


N-acetyl neuramic acid synthetase NeuB 


ORF01303 


SAG1162 


466 


cpsL protein ! 


ORF01304 


SAG1163 


318 


cpsVK protein 


ORF01305 


SAG1164 


321 


cpsVJ protein 


ORF01306 


SAG1165 


327 


cpsVO protein 


ORF01307 


SAG1166 


295 


cpsVN protein 


ORF01308 


SAG1167 


241 


cpsVM protein 


ORF01309 


SAG 1168 


364 


cpsVH protein 


ORF01310 


SAG 11 69 


163 


CpsVG 


ORF01311 


SAG1170 


149 


CpsF 


ORF01312 


SAG1171 


462 


CpsE 


ORF01313 


SAG1172 


229 


CpsD protein 


ORF01314 


SAG 11 73 


230 


cpsC protein 


ORF01315 


SAG1174 


243 


capsular polysaccharide biosynthesis protein CpsB 


ORF01316 


SAG1175 


485 


capsular polysaccharide biosynthesis protein CpsA 


ORF01317 


SAG 1176 


290 


capsular polysaccharide synthesis operon 
transcriptional regulator CpsY 


ORF01318 


SAG1177 


255 


cpslaS protein 


ORF01319 


SAG 11 78 


236 


purine nucleoside phosphorylase 


ORF01320 


SAG 11 79 


418 


voltage-gated chloride channel family protein, putative 


ORF01321 


SAG1180 


269 


purine nucleoside phosphorylase 


ORF01322 


SAG1181 


135 


arsenate reductase 


6rP01323 


SAG 1182 


403 


phosphopentomutase 


ORF01324 


SAG 1183 


223 


ribose 5-phosphate isomerase 
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ORF01326 


SAG 11 84 


236 


conserved hypothetical protein 


ORF01327 


SAG1185 


262 


tributyrin esterase 


ORF01328 


SAG1186 


553 


metal lo-beta-lactamase superfamily protein 


ORF01329 


SAG1187 


253 


ABC transporter, ATP-binding protein 


ORF01330 


SAG1188 


287 


ABC transporter, permease protein 


ORF01331 


SAG1189 


334 


conserved hypothetical protein 


ORF01332 


SAG1190 


551 


adherence and virulence protein A 


ORF01333 


SAG1191 


239 


alpha-acetolactate decarboxylase 


ORF01334 


SAG1192 


560 


acetolactate synthase, catabolic 


ORF01335 


SAG1193 


408 


TPR domain protein 


ORF01336 


SAG 11 94 


396 


membrane protein 


ORF01337 


SAG 11 95 


153 


MutT/nudix family protein 


ORF01338 


SAG1196 


160 


mutator MutT protein 


ORF01339 


SAG 11 97 


1072 


hyaiuronidase 


ORF01340 


SAG 11 98 


348 


dTDP-glucose 4,6-dehydratase 


ORF01341 , 


SAG1199 


197 


dTDP-4-dehydrorhamnose 3,5-epimerase 


ORF01342 


SAG 1200 


289 


glucose- 1 -phosphate thy midyly [transferase 


ORF01343 


SAG 1201 


367 


iminodiacetate oxidase, putative 


ORF01344 


SAG 1202 


262 


conserved hypothetical protein TIGR00486 


ORF01345 


SAG 1203 


227 


conserved hypothetical protein 


ORF01346 


SAG 1204 


226 


DNA replication protein Dnad, putative 


ORF01347 


SAG 1205 


172 


adenine phosphoribosyltransferase 


ORF01348 


SAG 1206 


854 


conserved domain protein 


ORF01349 


SAG 1207 


32 


hypothetical protein 


ORF01350 


SAG 1208 


732 


single-stranded- DNA-specific exonuclease RecJ 


ORF01351 


SAG 1209 


253 


oxidoreductase, short chain dehydrogenase/reductase 
family 


ORF01352 


SAG 1210 


309 


metallo-beta-lactamase superfamily protein 


ORF01353 


SAG 1211 


215 


conserved hypothetical protein 


ORF01354 


SAG1212 


412 


GTP-binding protein HfIX 


ORF01355 


SAG 121 3 


296 


tRNA delta(2)-isopentenylpyrophosphate transferase 


ORF01356 


SAG1214 


58 


hypothetical protein 


ORF01357 


SAG 121 5 


305 


exfoliative toxin A, putative 


ORF01358 


SAG1216 


1252 


pullulanase, putative 


ORF01361 


SAG 121 7 




conserved hypothetical protein, FRAMESHIFT 


ORF01362 


SAG 121 8 


194 


conserved hypothetical protein 


ORF01363 


SAG 121 9 


468 


peptidase, M20/M25/M40 family 


ORF01364 


SAG 1220 


200 


nitroreductase family protein 


ORF01365 


SAG1221 




glycerophosphoryl diester phosphodiesterase, 
putative, POINT MUTATION 


ORF01367 


SAG 1222 


593 


excinuclease ABC, C subunit 


ORF01368 


SAG 1223 


255 


conserved hypothetical protein 


ORF01369 


SAG 1224 


446 


MATE efflux family protein 


ORF01370 


SAG 1225 


136 


conserved hypothetical protein 


ORF01371 


SAG 1226 


165 


conserved hypothetical protein 


ORF01372 


SAG1227 


198 


conserved hypothetical protein 


ORF01373 


SAG 1228 


96 


!SSdy1 , transposase OrfA 


ORF01374 


SAG1229 


259 


ISSdyl , transposase OrfB 


ORF01375 


SAG1230 


96 


conserved hypothetical protein 


ORF01377 


SAG1231 




transposase OrfB, IS3 family, degenerate 
FRAMESHIFT 


ORF01379 


SAG 1232 


77 


transposase OrfB, IS3 family, truncation 


ORF01380 


SAG 1233 


822 


streptococcal histidine triad family protein 


ORF01381 


SAG1234 


306 


laminin-binding surface protein 
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UKr K6T IMO. 


SAGxxxx Ret No. 


aa 


Annotation 




OAPi ooe 


A OC 

425 


GBoii, group u intron, maturase 


UKrU I OOO 






coa peptidase precursor rK/\MtSrllFT 


UKrU 1 00*r 


oAb I £o / 


AAA 

444 


nypotneticai proxein 


UKrU l OOO 


oAbl 200 


OAO 

2U2 


nypotneucai proxein 


VJKrUT OOO 


oA(jlZoa 


fO 


conserved nypotneticai protein 


UKrUIOoY 


OAG124U 


A OC 

125 


conserved nypotneticai protein, truncation 


Anrrt h QQQ 

UKrUIOoo 


O A #"» 4 Ovl H 

OAG1241 


78 


transposase urtA, loo tamily 


URr01oo9 


SAG 1242 


67 


transposase UrfB, IS3 family, truncation 


ORPOioyu 


O A /""* A o An 

SAG 1243 


96 


iSSdyl, transposase OrfA FRAMESHIFT 


ORF01391 


O A A O A A 

SAG 1244 


259 


ISSdyl, transposase OrfB 


ORF01392 


SAG 1245 


38 


hypothetical protein 


ORF01393 


SAG 1246 


389 


hypothetical protein 


ORF01394 


SAG1247 


399 


integrase, phage family 


ORF01395 


SAG 1248 


75 


conserved hypothetical protein 


ORF01396 


SAG1249 


74 


transcriptional regulator, Cro/CI family 


ORF01397 


SAG 1250 


621 


Tn5252, relaxase 


ORF01398 


SAG1251 


121 


Tn5252, Orf 9 protein 


ORF01399 


SAG 1252 


120 


Tn5252, Orf 10 protein 


ORF01401 


SAG 1253 


435 


transposase, ISL3 family 


ORF01403 


SAG 1254 


546 


mercuric reductase 


ORF01404 


SAG 1255 


130 


mercuric resistance operon regulatory protein MerR 


ADCAH ^ AC 

URF014UO 


SAG12ob 


142 


IS861, transposase OrfB, truncation 


URF01407 


SAG 125 7 


*7AA 

709 


cation-transporting ATPase, E1-E2 family 


URF01408 


O A -4 OC O 

SAG 1258 


,4 OO 

122 


cadmium efflux system accessory protein 


ADCrtH v< O A 

ORr 01409 


SAG 1259 


99 


conserved hypothetical protein 


UKF0141U 


oAolzbO 


OOO 

262 


hypothetical protein 


UKrU141 i 


O A/"*»H OCM 

oAG12b1 


19o 


conserved nypotneticai protein 


UKFU1412 


SAG 1202 


CAE 

byo 


cation-transporting A l Pase, bl-c2 family 


UKrU1414 






conserved domain protein, rKAMbonlr l 


AQCn A A AC 

UKr0141O 


SAG12b4 


A A Q 

14o 


transcriptional repressor CopY, putative 


nnrn A A A C 

UKrU141b 




2UO 


cadmium resistance transporter, putative 


UKrU 141 * 


oA(j 1 2ob 


•ICO 


nypotneticai protein 


r^Dcrrn aao 
OKh0141o 


SAG 1 2b / 


A AO 

lUo 


hypothetical protein 


UKrU1419 * 


oAbubo 


OO A 

2o0 


ktJBk Bk aB dlk. ^b. Bat J_~l_ M#k. A b BOTk. i ■_ ■ ■ JL ^k. ^ 2 k B< A 

repressor protein, putative 


UKrU142U 


O A f"* -1 OCO 

oAGi2b9 


A A 

44 


nypotneticai protein 


UKrU1421 


oAb 1 2 f U 


4/1 


impD/Muct>/oamt5 tamily protein 


ADCA'l ^1 OQ 

UKrU1420 


OAP-1 07<1 

oAou/1 


a a a 

no 


conserved nypotneticai protein 


UKr 01 424 


O A/"*M 070 


1U2 


u"k aBki m * ^k. jbJ Lbbb, >u\ «V Lk hX b <b#b I b*S> M/sX J|S B Wk 

conserved nypotneticai protein 


UKF0142O 




A A Q 

llo 


conserved nypotneticai protein 


URF0142b 




A OA 

129 


conserved hypothetical protein 


URF01427 


SAG 1275 


7C 

75 


hypothetical protein 


ORF01428 


SAG 127b 


OCO 

o5o 


conserved hypothetical protein 


ORF01430 


SAG 1277 


163 


hypothetical protein | 


ORF01431 


SAG 1278 


96 


hypothetical protein 


Bkkk Eka*. #BkB Bkk a B ^k. 

ORF01432 


SAG 1279 


99 


conserved domain protein 


ORF01433 


C*« A A OO O 

SAG 1280 


2274 


Helicases conserved C-terminal domain protein 


ORF01434 


SAG1281 


183 


hypothetical protein 


ORF01435 


SAG 1282 


63 


lipoprotein, putative 


ORF01436 


SAG1283 


1631 


ceii wall surface anchor family protein 


ORF01437 


SAG 1284 


196 


abortive infection protein AbiGI 


ORF01438 


SAG1285 


281 


abortive infection protein AbiGI 1 


ORF01439 


SAG 1286 


933 


conserved hypothetical protein 


ORF01440 


SAG1287 


776 


conserved hypothetical protein 


ORF01441 


SAG1288 


117 


conserved hypothetical protein, DEGENERATE 
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ORF Ref No. 


SAGxxxx Ref No. | 


aa |i 


annotation [ 


ORF01442 


SAG 1289 


284 


conserved hypothetical protein | 


ORF01443 


SAG 1290 | 


80 j 


hypothetical protein s 


ORF01444 


SAG1291 | 


605 j 


Tn5252, On zi protein, internal aeietion | 


ORF01445 


SAG1292 | 


162 | 


hypothetical protein 


ORF01446 


SAG 1293 j 


194 | 


protease, putative J 


ORF01447 


SAG 1294 j 


77 | 


conserved hypothetical protein j 


ORF01449 


SAG 1295 


127 j 


conserved hypothetical protein j 


ORF01450 


SAG 1296 


142 


conserved hypothetical protein j 


ORF01451 


SAG1297 


451 


type II DNA modification methyltransferase Spn5252IP 1 


ORF01452 


SAG1298 


31 


hypothetical protein 1 


ORF01453 


SAG1299 


272 


conserved hypothetical protein j 


ORF01454 


SAG1300 


57 


conserved hypothetical protein 


ORF01455 


SAG1301 


121 


nbosomal protein L7/L12 | 


ORF01456 


SAG1302 


166 


ribosomal protein L1 0 j 


ORF01458 


SAG 1303 


702 


ATP-dependent Clp protease, ATP-binding subunit I 


ORF01459 


SAG 1304 | 


32 j 


hypothetical protein j 


ORF01460 


SAG1305 


314 


homocysteine S-methyltransferase MmuM, putative 


ORF01461 


SAG1306 I 


458 | 


amino acid permease j 


ORF01463 


SAG 1307 I 


216 


hypothetical protein 


| ORF01464 


SAG1308 


167 


nypotnetical protein 


ORF01465 


SAG 1309 | 


30 


hypothetical protein j 


ORF01466 


SAG1310 


182 


transcriptional regulator, letKTamiiy 


ORF01467 


SAG1311 


198 


GTP-Dinding protein | 


ORF01468 


SAG1312 


408 


ATP-dependent oip protease, a i r'-Dinaing suuunit i 


OKrU14oy 




OD 




OKP0 1470 




I OH 


Hih\/rlro*folatA r^Hi ir*taQ^ f 
uti iy vJi \j\ uiciiw icuui^iciou i 


ORFQ1471 


OAvjIoI O 


z/ y 


if iyi i iiuyiait? oyiiuicioc; | 


ORF01472 


oAol oto 


I oyu 


WMf5.PnA cvnthacf* 1 
| rllvlw"V-»Ur\ oyi HI laoc ^ 


ORF01473 




*tZ / 


I *a\ h\/Hrrvv\/-.^-mc»th\7lrih itarvUf^nA rprinctasp 1 
| O f lyLlI UAy o 1 1 iti Li iy iy i Li lctji y i uyn i cuuuiaoc | 


ORF01474 






| coiiocrvfcjij i lypuii itjuuai pujitsui i 


ORF01475 


SAo1*3iy 


lof 


j nemoiysm in, puiauvc s 


ORF01476 




I on/ 

j 


1 />r«ncan/£iH hwnnthotir-al nrntf^in Tl f^ROt ") i 4 / ffi 
| trOiloclVcU 1 lypUll ItSUOcll jJIULClll i 1 V_J 1\UU i*t/ | 


ORF01477 




j Zo4 


1 nli itafhinno Q-tranQfiaraQp famtlv nrntpin I 

| ylUlclU tlUIlc O-ll al lolcl aoc leal I Illy piuiciu | 


ORF01478 




I 


t AAncan/aH HrMTiiain nrntAin 1 
| LfUI loci VtJli UUIlldlf 1 | 


ORF01479 


oAo i oZo 


| OO I 


1 icr\n(3nton\/LHinhnc;rjhptp Hplta-i^nmprase 1 

i loUiJtJI IIC l iy I UILJI laic uciia ioui i luiaos 


ORF01480 


oAol oZ4 


1 **^n 


1 r\hrtcnh>nmo\/cjlr>n otp kina^iP 1 
1 Ul ItJoUl \vJl 1 ItJvalUI Ictlv? Mliaoc I 


ORF01481 


O A f** H OOC 


1 «3 14 


| Qipi lOopi lUI I JfcJVaiUI laLts UcliaiuuAyiaoc [ 


ORF01482 


oAvjl oZo 


1 QQO 

1 zyz 


1 nic VdlUI idle Mlldot?, [JUlallvC [ 


ORF01483 


O A p<007 

SACjI oZ/" 


4uy 


i ssnsor nisiiQine KindotJ | 


ORF01484 


SAG 1328 


f ZZo 


j uiNA-Dinaing response regulator i 


ORF01485 


SAG1 329 


I 4f~\ f\ f\ 

1 208 


| GTP pyrophosphokinase family protein j 


ORF01486 


SAG 1330 


68 


It all* 11*^ 1 

j hypothetical protein 


ORF01488 


bAblooT 


1 Q7Q 

y / y 


1 nrAtoin 

i r\0 pruiein j 


ORF01489 


SAG1332 


146 


transcriptional regulator, MarR family, putative 


ORF01490 


SAG1333 


690 


j 5'-nucleotidase family protein [ 


ORF01491 


SAG1334 


136 


| polypeptide deformylase, putative | 


ORF01492 


SAG 1335 


449 


1 NADP-specific glutamate dehydrogenase j 


ORF01494 


SAG1336 


169 


1 conserved hypothetical protein 


ORF01495 


SAG1337 


589 


ABC transporter, ATP-binding/permease protein | 


ORF01496 


SAG1338 


579 


j ABC transporter, ATP-binding/permease protein 


ORF01497 


SAG1339 


j 157 


I acetyltransferase, GNAT family 
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aa I 


A * A » 

Annotation 


ORF01498 


SAG 1340 


622 j 


ABC transporter, Al K-Dinaing protein 


ORF01499 


SAG1341 | 


402 j 


poIyA polymerase Tamiiy protein 


ORF01500 


SAG 1342 j 


282 j 


DegV family protein 


ORF01501 


SAG 1343 j 


126 j 


conserved nypotneticai protein j 


ORF01502 


At***. A A*—*. A JT\ »t At 1 

SAG1344 | 


177 


hypothetical protein 


ORF01503 


SAG1345 


164 


conserved hypothetical protein 


ORF01504 


SAG1346 


641 


PTS system, fructose specific li ABC components 


ORF01505 


SAG1347 


303 


1 -phosphofructokinase 


ORF01506 


SAG1348 


247 


lactose phosphotransferase system repressor 


ORF01507 


SAG1349 


411 


beta-lactam resistance factor 


ORF01508 


SAG1350 


544 


surface antigen-related protein 


ORF01509 


SAG1351 


307 


2-dehydropantoate 2-reductase, putative 


ORF01510 


SAG1352 


356 


regulatory protein, putative 


ORF01511 


SAG1353 


330 


pyridine nucleotide-disulphide oxidoreductase family 
protein 


ORF01512 


SAG 1354 


251 


tRNA (guanine-N 1 )-methyltransferase 


ORF01513 


SAG 1355 


172 


16S rRNA processing protein RimM 


ORF01515 


SAG1356 


503 


transcriptional regulator, RofA family 


ORF01516 


SAG1357 


80 


KH domain protein 


ORF01517 


SAG1 358 


90 


ribosomal protein S16 


ORF01518 


SAG 1359 


415 


permease, putative 


ORF01519 


SAG1360 


236 


ABC transporter, ATP-binding protein 


ORF01520 


SAG1361 


414 


conserved hypothetical protein 


ORF01522 


SAG 1362 


532 


carbamoyl-phosphate synthase, large subunit, putative 


ORF01523 


SAG1363 


356 


carbamoyl-phosphate synthase, small subunit 


ORF01524 


SAG1364 


173 


pyrimidine operon regulatory protein 


ORF01525 


SAG1 365 


296 | 


ribosomal large subunit pseudoundine synthase, Rluu 
subfamily 


ORF01526 


SAG 1366 


154 


lipoprotein signal peptiaase 


ORF01527 


SAG 1367 


301 


transcriptional regulator, LysK Tamiiy 


ORF01528 


SAG 1368 


94 


ribosomal protein lz/ 


ORF01529 


SAG1369 


112 


conserved hypothetical protein 


ORF01530 


SAG1370 


A f\ A 

\ 104 


j ribosomal protein Lzi 


r ORF01531 


t~*A a ^ — T ,4 

SAG1371 


I 392 


| conserved nypotneticai protein 


| ORF01532 


; SAG 1372 


i A f\A 

\ 404 


| thiamine oiosyntnesis protein inn 


| ORF01533 


SAG 1373 


1 O OA 

1 OOl 


1 cysteine oesuipnurase 


ORF01535 


SAG 1374 


i 150 


| cons6rv6u nypoineucai proiein 


j ORF01536 


SAG1375 


t A ACi 

j 44y 


1 giutatnione reuuciase 


j ORF01537 


SAG 1376 


t AAA 
| 111 


1 conserveo nypotneuudi piuimn 


j ORF01538 


SAG 1377 


I OOO 


| cnonsmate synindoe 


ORF01539 


O A H O TO 

SAG 1378 


! OOO 


| o-oenyaroquinaie synindbt? 


ORF01540 


SAG 1379 


) OOK 
| ZZO 


1 cs-oenyoroquinaie uenyaraicisse 


ORF01541 


SAG 1380 


I o o c 

I 38o 


1 conserved nypotneticai protein 


ORF01542 


SAG1381 


714 


suifatase 


| ORF01543 


oAo 1 ooz 


I lit? 


1 rihnQrMTjal nrntpin 1 Pf) 
I I iuuoui i Idi \Ji u icii i i — cyj 


ORF01544 


SAG1383 


66 


j ribosomal protein L35 ! 


ORF01545 


SAG1384 


176 


translation initiation factor IF-3 


ORF01546 


SAG1385 


227 


| cytidylate kinase 


ORF01547 


SAG1386 


174 


j conserved hypothetical protein 


ORF01548 


SAG1387 


65 


1 ferredoxin, 4Fe-4S 


ORF01549 


SAG1388 


163 


j conserved hypothetical protein 


ORF01550 


SAG1389 


406 


| peptidase t 


ORF01551 


SAG 1390 


I 544 


1 polysaccharide biosynthesis protein, putative 
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ORF01552 


SAG1391 


484 


UDP-N-acetylmuramoyialanyl-D-glutamate--2,6- 
diaminopimelate ligase 


ORF01553 


SAG 1392 


264 


iron compound ABC transporter, ATP-binding protein 


ORF01554 


SAG1393 


310 


iron compound ABC transporter, substrate-binding 
protein 


ORF01555 


SAG 1394 


341 


iron compound ABC transporter, permease protein 


ORF01556 


SAG 1395 


333 


iron compound ABC transporter, permease protein 


ORF01557 


SAG 1396 


217 


conserved hypothetical protein 


ORF01558 


SAG 1397 


311 


inorganic pyrophosphatase, manganese-dependent 


ORF01559 


SAG1398 


262 


pyruvate formate-lyase-activating enzyme 


ORF01560 


SAG1399 


444 


CBS domain protein 


ORF01561 


SAG1400 


188 


It III* V r * 

conserved hypothetical protein 


ORF01563 


SAG1401 


311 


conserved hypothetical protein TIGR01212 


ORF01564 


SAG1402 


213 


PAP2 family protein 


ORF01565 


SAG1403 


194 


membrane protein, putative 


ORF01566 


SAG1404 


308 


celi wall surface anchor family protein 


ORF01567 


SAG1405 


294 


sortase family protein 


ORF01568 


SAG1406 


293 


sortase family protein 


ORF01569 


SAG1407 


705 


cell wall surface anchor family protein 


ORF01570 


SAG1408 


901 


cell wall surface anchor family protein 


ORF01571 


SAG1409 


326 


transcriptional regulator, RofA family FRAMESHIFT 


ORF01572 


SAG1410 


379 


glycosyl transferase, group 1 family protein 


ORF01573 


SAG1411 


282 


exopolysaccharide biosynthesis protein, putative 


ORF01574 


SAG1412 


474 


exopolysaccharide biosynthesis protein, putative 


ORF01575 


SAG1413 


454 


hypothetical protein 


ORF01 576 


SAG1414 


308 


glycosyl transferase, group 2 family protein 


ORF01 577 


SAG1415 


311 


glycosyl transferase, group 2 family protein 


ORF01578 


SAG1416 


352 


dTDP-glucose 4,6-dehydratase, putative 


ORF01579 


SAG1417 


240 


4-diphosphocytidyl-2C-methyl-D-erythritol synthase, 
putative 


ORF01580 


SAG1418 


259 


licD protein, putative 


ORF01581 


SAG1419 


577 


hypothetical protein 


ORF01582 


SAG1420 


117 


conserved hypothetical protein 


ORF01583 


SAG1421 


243 


glycosyl transferase, group 2 family protein 


ORF01584 


SAG 1422 


313 


glycosyl transferase, group 2 family protein 


ORF01585 


SAG 1423 


384 


conserved hypothetical protein 


ORF01586 


SAG1424 


284 


dTDP-4-dehydrorhamnose reductase 


ORF01587 


SAG1425 


113 


conserved hypothetical protein 


ORF01589 


SAG1426 


369 


RNA polymerase sigma-70 factor 


ORF01590 


SAG1427 


602 


DNA primase 


ORF01591 


SAG1428 


125 


large conductance mechanosensitive channel protein 


ORF01592 


SAG1429 


58 


ribosomal protein S21 


ORF01593 


SAG1430 


167 


conserved hypothetical protein 


ORF01594 


SAG1431 


268 


amino acid ABC transporter, amino acid-binding 
protein 


ORF01596 


SAG1432 


347 


ammonium transporter family protein 


ORF01597 


SAG1433 


375 


conserved hypothetical protein 


ORF01598 


SAG 1434 


328 


rhodanese family protein 


ORF01599 


SAG1435 


101 


conserved hypothetical protein 


ORF01600 


SAG1436 


457 


glyceroi-3-phosphate transporter, putative 
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ORF01601 


SAG1437 


55 


hypothetical protein 


ORF01602 


SAG1438 


754 


glycogen phosphorylase 


ORF01603 


SAG1439 


498 


4-alpha-giucanotransferase 


ORF01604 


SAG1440 


342 


maltose operon repressor MaIR, putative 


ORF01605 


SAG1441 


415 


maltose/maltodextrin ABC transporter, 
maltose/maltodextrin-bindmg protein 


ORF01606 


SAG1442 


456 


maltose ABC transporter, permease protein 


ORF01607 


SAG1443 


278 


maltose ABC transporter, permease protein 


ORF01608 


SAG1444 


490 


proton/peptide symporter family protein 


ORF01610 


SAG1445 




MutT/nudix family protein, FRAMESHIFT 


ORF01611 


SAG1446 


62 


hypothetical protein 


ORF01612 


SAG1447 


441 


conserved hypothetical protein 


ORF01613 


SAG 1448 


502 


glycosyl transferase, group 1 family protein 


ORF01614 


SAG 1449 


795 


preprotein translocase SecA subunit, putative 


ORF01615 


SAG 1450 


330 


conserved domain protein 


ORF01617 


SAG1451 


494 


conserved hypothetical protein 


ORF01618 


SAG1452 


514 


conserved hypothetical protein 


ORF01619 


SAG1453 


409 


preprotein translocase SecY family protein 


ORF01621 


SAG1454 


398 


conserved hypothetical protein 


ORF01622 


SAG 1455 


295 


glycosyl transferase, group 2 family protein 


ORF01623 


SAG 1456 


312 


glycosyl transferase, family 8, degenerate 


ORF01624 


SAG1457 


129 


IS1381, transposase OrfB 


ORF01625 


SAG1458 


127 


IS1381 , transposase OrfA 


ORF01626 


SAG 1459 


413 


glycosyl transferase family 8 


ORF01627 


SAG 1460 


401 


glycosyl transferase, family 8 


ORF01628 


SAG 1461 


335 


conserved hypothetical protein 


ORF01630 


SAG1462 


970 


cell wall surface anchor family protein 


ORF01632 


SAG1463 




transcriptional regulator, RofA family POINT 
MUTATION 


ORF01634 


SAG 1464 


663 


excinuclease ABC, B subunit 


ORF01635 


SAG1465 


306 


protease, putative 


ORF01636 


SAG1466 


727 


glutamine ABC transporter, glutamine-binding 
protein/permease protein, putative 


ORF01637 


SAG 1467 


246 


glutamine ABC transporter, ATP-binding protein, GlnQ 
putative 


ORF01638 


SAG 1468 


116 


conserved hypothetical protein 


ORF01639 


SAG 1469 


52 


conserved hypothetical protein 


ORF01640 


SAG 1470 


437 


GTP-binding protein, GTP1/Obg family 


ORF01641 


SAG1471 


42 


conserved hypothetical protein 


ORF01643 


SAG 1472 


413 


aminopeptidase PepS 


ORF01645 


SAG 1473 


192 


cell wall surface anchor family protein 


ORF01646 


SAG 1474 


680 


amidase family protein 


ORF01647 


SAG 1475 


240 


ribosomal small subunit pseudouridine synthase A 


ORF01648 


SAG 1476 


280 


oxidoreductase, aldo/keto reductase family 


ORF01650 


SAG1477 


224 


nitroreductase family protein 


ORF01651 


SAG 1478 


130 


lactoylglutathione lyase 


ORF01652 


SAG 1479 


308 


glycosyl transferase, group 2 family protein 


ORF01653 


SAG 1480 


462 


amino acid permease 


ORF01654 


SAG1481 


155 


SsrA-binding protein 


ORF01655 


SAG 1482 


801 


exoribonuciease, VacB/Rnb family 


ORF01657 


SAG1483 


78 


preprotein translocase, SecG subunit 


ORF01658 


SAG 1485 


389 


multi-drug resistance protein 


ORF01660 


SAG1486 


548 


hypothetical protein 


ORF01661 


SAG1487 


233 


ABC transporter, ATP binding protein 
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ORF01662 


SAG 1488 


195 


dephospho-CoA kinase 


ORF01663 


SAG 1489 


273 


formamidopyrimidine-DNAglycosylase 


ORF01665 


SAG 1490 


282 


transcriptional regulator, MutR family 


ORF01666 


SAG1491 


530 


hypothetical protein j 


ORF01667 


SAG 1492 


58 


hypothetical protein 


ORF01668 


SAG 1493 




hypothetical protein 


ORF01670 


SAG 1494 


32 


hypothetical protein 


ORF01672 


SAG 1495 


81 


protease, putative, POINT MUTATION 


ORF01673 


SAG 1496 


110 


hypothetical protein 


ORF01674 


SAG 1497 


37 


hypothetical protein 


ORF01675 


SAG 1498 


133 


hypothetical protein 


ORF01677 


SAG 1499 


299 


GTP-binding protein Era 


ORF01678 


SAG 1500 


132 


diacylglycerol kinase 


ORF01679 


SAG1501 


161 


conserved hypothetical protein T1GR00043 


ORF01680 


SAG 1502 


268 


tetracenomycin polyketide synthesis O- 
methyltransferase TcmP, putative 


ORF01681 


SAG 1503 


39 1 


hypothetical protein 


ORF01682 


SAG 1504 


38 


hypothetical protein 


ORF01683 


SAG 1505 


158 


MutT/nudix family protein 


ORF01684 


SAG 1506 


267 


hypothetical protein 


ORF01685 


SAG 1507 


345 


PhoH family protein 


ORF01 686 


SAG1508 


590 


67 kDa Myosin-crossreactive streptococcal antigen 


ORF01687 


SAG 1509 


71 


conserved hypothetical protein 


ORF01688 


SAG1510 


169 


peptide methionine sulfoxide reductase 


ORF01689 


SAG1511 


284 


conserved hypothetical protein 


ORF01690 


SAG1512 


185 


ribosome recycling factor 


ORF01691 


SAG1513 


242 


uridylate kinase 


ORF01692 


SAG1514 


226 


peptide ABC transporter, ATP-binding protein 


ORF01693 


SAG1515 


262 


peptide ABC transporter, ATP-binding protein 


ORF01694 


SAG1516 


255 


peptide ABC transporter, permease protein 


ORF01695 


SAG1517 


314 


peptide ABC transporter, permease protein 


ORF01696 


SAG 151 8 


525 


peptide ABC transporter, peptide-binding protein j 


ORF01697 


SAG1519 


229 


ribosomal protein L1 


ORF01698 


SAG 1520 


141 


ribosomal protein L1 1 


ORF01699 


SAG 1521 


388 


transposase, IS30 family, putative \ 


ORF01700 


SAG 1522 


460 


transporter, major facilitator family 


ORF01702 


SAG 1523 


404 


peptidase, M20/M25/M40 family 


ORF01703 


SAG 1524 


294 


transcriptional regulator, LysR family 


ORF01704 


SAG 1525 


117 


conserved hypothetical protein 


ORF01705 


SAG 1526 


178 


IS861, transposase OrfA 


ORF01706 


SAG 1527 


277 


IS861, transposase OrfB 


ORF01707 


SAG 1528 


571 


chorismate binding enzyme 


ORF01708 


SAG 1529 


785 


FtsK/SpoMIE family protein 


ORF01709 


SAG 1530 


267 


Deotidvl-orolvl cis-trans isomerase cvcloohiiin-tvoe 


ORF01710 


SAG 1531 


277 


manganese ABC transporter, permease protein 


ORF01711 


SAG 1532 


238 


manganese ABC transporter, ATP-binding protein 


ORF01712 


SAG 1533 


308 


manganese ABC transporter, manganese-binding 
adhesion liprotein 


ORF01713 


SAG 1534 


215 


iron-dependent transcriptional regulator 


ORF01714 


SAG 1535 


229 


5-methylthioadenosine nucleosidase/S- 
adenosylhomocystelne nucleosidase 


ORF01715 


SAG 1536 


89 


conserved hypothetical protein 


ORF01716 


SAG 1537 


184 


MutT/nudix family protein 
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ADCn<1 7H O 

UKrUI fio 


(— » a a CO Q 
SAGIOOO 


459 i 


UDr-N-acetyigiucosamme pyropnospnoryiase 


ORFQ1719 


SAG 1539 


O A 

31 


nypotneticai protein 


r~~o a toa 

ORF01720 


SAG 1540 


137 


conserved nypotneticai protein 


ORF01721 


SAG 1541 


or 

125 


glyoxaiase Tamiiy protein 


ORF01722 


SAG 1542 


318 


oxidoreouctase, kjTo/iah/MocA Tamiiy 


ORF01724 


SAG 1543 




conserved nypotneticai protein, FKAMtbHIr I 


ORF01725 


SAG 1544 


232 


gluconate o-aenydrogenase, putative 


ORF01726 


SAG 1545 


78 


conserved hypothetical protein 


ORF01727 


SAG 1546 


82 


conserved hypothetical protein 


ORF01729 


SAG 1547 


166 


acetyltransferase, GNAT family 


ORF01730 


SAG 1548 


422 


glycosyl transferase, group 2 family protein 


ORF01731 


SAG 1549 


127 


IS1381 , transposase OrfA 


ORF01732 


SAG 1550 


129 


IS1381, transposase OrfB 


ORF01733 


SAG 1551 


67 


hypothetical protein 


ORF01734 


SAG 1552 


719 


conserved hypothetical protein 


ORF01735 


SAG 1553 


477 


hypothetical protein 


ORF01736 


SAG 1554 


225 


hypothetical protein 


ORF01737 


SAG 1555 


231 


hypothetical protein 


ORF01738 


SAG1556 


445 


branched-chain amino acid transport system II carrier 








protein 


ORF01739 


SAG1 557 


665 


methionyl-tRNA synthetase 


ORF01740 


SAG 1558 


291 


tellunte resistance protein TehB 


ORF01741 


SAG 1559 


231 


membrane protein, putative 


ORF01 742 


SAG 1560 


40 


hypothetical protein 


ORF01743 


SAG 1561 


405 


PTS system component, putative 


ORF01744 


SAG 1562 


280 


conserved hypothetical protein 


ORF01745 


SAG 1563 


275 


exodeoxyribonuclease 


ORF01 746 


SAG 1564 


118 


conserved hypothetical protein 


ORF01747 


SAG 1565 


158 


methylated-DNA-protein-cysteine S-methyltransferase 


ORF01748 


SAG1566 


393 


D-isomer specific 2-hydroxyacid dehydrogenase family 








protein 


ORF01749 


SAG1567 


A OO 

182 


acetyltransferase, bNA i tamiiy 


Or>r"A «< *-jr o 

ORF01 750 


SAG1 000 




pnospnosenne aminotransferase rr<Aivit:onir i 


ORF01752 


SAG 1569 


O A. A 

21 1 


copper homeostasis protein outo, putative 


ORF01753 


SAG 1570 


f\ A 

34 


conserved hypothetical protein 


ORF01754 


SAG1571 


53 


1 f 1 E » | 1_ ■ 

hypothetical protein 


ORF01755 


SAG1572 


287 


tetrapyrrole methylase family protein 


ORF01756 


SAG1573 


108 


conserved hypothetical protein 


ORF01758 


SAG1574 


287 


DNA polymerase III, delta prime subunit, putative 


ORF01759 


SAG1575 


211 


thymidylate kinase 


ORF01761 


SAG1576 


267 


transposase, IS30 family, putative, truncation 


ORF01763 


SAG1577 


219 


AcuB family protein 


ORF01764 


SAG1578 


236 


branched-chain amino acid ABC transporter, ATP- 








binding protein 


ORF01765 


SAG1579 


254 


branched-chain amino acid ABC transporter, ATP- 








binding protein 


ORF01766 


SAG 1580 


317 


branched-chain amino acid ABC transporter, 








permease protein 


ORF01767 


SAG 1581 


289 


branched-chain amino acid ABC transporter, 








permease protein 


ORF01769 


SAG1582 


388 


branched-chain amino acid ABC transporter, amino 








acid-binding protein 


ORF01770 


SAG 1583 


81 


conserved hypothetical protein 


ORF01772 


SAG 1584 


377 


IS1548, transposase 
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ORF Ref No. 



ORF01773 



SAGxxxx Ref No. 



SAG1585 



aa (Annotation 



1 96 I ATP-dependent Clp protease, proteolytic subunit CIpP 



ORF01774 



ORF01775 



ORF01777 



ORF01778 



ORF01779 



SAG1586 



209 uracil phosphoribosyltransferase 



SAG1587 



389 [aminotransferase, class I 



SAG1588 



182 I RNA methyltransferase, TrmH family, group 2 



SAG1589 



450 amino acid permease, putative 



SAG1590 



449 | potassium uptake protein, Trk family 



ORF01780 



SAG 1591 



475 cation uptake protein, Trk family 



ORF01781 



SAG1592 



83 I conserved hypothetical protein TIGR00278 



ORF01782 



SAG1593 



240 | ribosomal large subunit pseudouridine synthase B 



ORF01783 



ORF01784 



SAG 1594 



194 | conserved hypothetical protein TIGR00281 



SAG 1595 



235 I Uncharacterized ACR, COG1354 



ORF01785 



SAG 1596 



246 | integrase/recombinase, phage integrase family 



ORF01786 



SAG 1597 



1 57 CBS domain protein 



ORF01787 



SAG 1598 



173 | conserved hypothetical protein 



ORF01788 



SAG 1599 



324 HAM1 protein 



ORF01789 



SAG 1600 



264 glutamate racemase 



ORF01790 



SAG1601 



79 I conserved hypothetical protein 



ORF01791 



SAG1 602 



180 I membrane protein, putative 



ORF01792 



ORF01793 



SAG 1603 



173 | transcriptional regulator, biotin repressor family 



SAG 1604 



229 | membrane protein, putative 



ORF01794 



SAG 1605 



167 | conserved hypothetical protein 



ORF01795 



SAG1606 



247 | RNA methyltransferase, TrmH family 



ORF01796 



ORF01797 



SAG 1607 



92 | acylphosphatase 



SAG 1608 



310 I membrane protein, putative 



ORF01799 



SAG 1609 



"221 | amino acid ABC transporter, permease protein 



ORF01800 



SAG1610 



285 I amino acid ABC transporter, substrate-binding protein 



ORF01801 



SAG1611 



486 I amidase family protein 



ORF01802 



SAG1612 



160 I transcription elongation factor GreA 



ORF01803 



SAG1613 



"600 | Uncharacterized BCR, YceG family COG1559, putative 



ORF01804 



SAG1614 



167 | acetyltransferase, GNAT family 



SAG1615 



443 I UDP-N-acetylmuramate-alanineTigase 



ORF01806 



SAG1616 



205 I conserved hypothetical protein 



ORF01808 



SAG1617 



32 | hypothetical protein 



SAG1618 



1032 |Snf2 family protein 



SAG1619 



377 I 1S1548, transposase 



SAG1620 



436 | phosphoglycerate dehydrogenase-related protein 



SAG1621 



300 | primosomal protein Dnal 



SAG1622 



"391 | conserved hypothetical protein 



ORF01814 



SAG 1623 



*159 | conserved hypothetical protein T1GR00244 



SAG 1624 



501 I sensor histidine kinase CsrS 



SAG1625 



229 I DNA-binding response regulator CsrR 



SAG 1626 



177 | conserved hypothetical protein 



ORF01820 



SAG1627 



296 I heat shock protein HtpX 



SAG 1628 



184 lemA protein 



SAG 1629 



237 I glucose-inhibited division protein B 



SAG 1630 



459 | sodium transport family protein 



SAG1631 



223 | potassium uptake protein, Trk family, putative 



SAG 1632 



276 l cobalt transport family protein 



ORF01825 



SAG 1633 



558 I ABC transporter, ATP-binding protein 



ORF01826 



SAG1 634 



212 | conserved hypothetical protein 



ORF01827 



SAG 1635 



"402 | sodiumtdicarboxylate symporter family protein 
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ORF01828 


SAG1636 


455 


branched-chain amino acid transport system II carrier 
protein 


ORF01 829 


SAG 1637 


351 


alcohol dehydrogenase, zinc-containing 


ORF01830 


SAG1638 


230 


ABC transporter, permease protein 


ORF01831 


SAG1639 


356 


ABC transporter, ATP-binding protein 


ORF01832 


SAG 1640 


458 


peptidase, M20/M25/M40 family 


ORF01833 


SAG1641 


274 


lipoprotein, putative 


ORF01834 


SAG 1642 


277 


ABC transporter, substrate-binding protein 


ORF01835 


SAG1643 


229 


glutamine amidotransferase, class I 


ORF01836 


SAG 1644 


37 


hypothetical protein 


ORF01837 


SAG1645 


238 


conserved hypothetical protein TIGR01033 


ORF01838 


SAG 1646 


32 


hypothetical protein 


ORF01839 


SAG1 647 


328 


dihydroxyacetone kinase family protein 


ORF01840 


SAG1648 


178 


transcriptional regulator, TetR family, putative 


ORF01842 


SAG 1649 


37 


hypothetical protein 


ORF01843 


SAG 1650 


329 


dihydroxyacetone kinase family protein 


ORF01844 


SAG1651 


192 


dthydroxyacetone kinase family protein 


ORF01845 


SAG 1652 


124 


hi ft** i ■ ■ 

conserved hypothetical protein 


ORF01846 


SAG 1653 


237 


glycerol uptake facilitator protein 


ORF01847 


SAG 1654 


134 


conserved hypothetical protein 


ORF01848 


SAG1655 


237 


transcnptional regulator, MerR family 


ORF01849 


SAG1656 


369 


conserved hypothetical protein 


ORF01850 


SAG1657 


83 


> All* 1 * « 

hypothetical protein 


ORF01851 


SAG1658 


244 


conserved hypothetical protein 


ORF01852 


SAG 1659 


118 


lojap-related protein 


ORF01853 


SAG 1660 


173 


isochorismatase family protein 1 


ORF01854 


SAG1661 


195 


conserved hypothetical protein TIGR00488 


ORF01855 


SAG1662 


210 


conserved hypothetical protein TIGR00482 


ORF01856 


SAG 1663 


105 


conserved hypothetical protein TIGR00253 


ORF01857 


SAG1664 


372 


GTP-bmding protein 


ORF01858 


SAG 1665 


177 


hydrolase, haloacid dehalogenase-hke family 


ORF01859 


SAG 1666 


295 


membrane protein 


ORF01860 


SAG1 667 


480 


giutamyl-tRNA(Gln) amidotransferase, B subunit 


ORF01861 


SAG1668 


488 


glutamyl-tRNA(Gln) amidotransferase, A subunit 


ORF01862 


SAG1669 


100 


glutamyl-tRNA(Gln) amidotransferase, C subunit 


ORF01863 


SAG 1670 


881 


pyruvate phosphate dikmase 


ORF01864 


SAG1671 


276 


conserved hypothetical protein 


ORF01865 


SAG 1672 


! 170 


CBS domain protein 


ORF01866 


SAG 1673 


377 


3-hydroxyacyl-CoA dehydrogenase family protein 


ORF01867 


SAG 1674 


182 


isochorismatase family protein 


ORF01869 


SAG 1675 


261 


transcriptional regulator CodY, putative 


! ORF01870 


SAG1 676 


403 


aminotransferase, class 1 


ORF01871 


SAG 1677 


137 


universal stress protein family FRAMESHIFT 


ORF01872 


SAG1678 


460 


hydrolase, haloacid dehalogenase-like family 


ORF01873 


SAG1679 


320 


asparaginase family protein 


ORF01874 


SAG 1680 


292 


shikimate 5-dehydrogenase 


ORF01875 


SAG 1681 


304 


oxidoreductase, aldo/keto reductase family 


ORF01876 


SAG1 682 


671 


ATP-dependent DNA helicase RecG 


ORF01877 


SAG1683 


512 


immunogenic secreted protein, putative 


ORF01 878 


SAG1684 


366 


alanine racemase 


ORF01879 


SAG 1685 


119 


hoto-(acyl-carrier-protein) synthase 


ORF01880 


SAG 1686 


335 


phospho-2-dehydro-3-deoxyheptonate aldolase 


ORF01881 


SAG1687 


842 


preprotein translocase, SecA subunit 


ORF01882 


SAG1688 


315 


mannose-6-phosphate isomerase, class 1 


ORF01883 


SAG1689 


293 


fructokinase 
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/— \ r—> tr A «f ooc 

ORFQ1885 


SAG 1690 


£2 O A 

639 


PTS system II ABC components 


UKruloob 


SAG 169 1 


yl TA 

479 


sucrose-6-phosphate hydrolase 


(JKrUl oo7 


o AG lb 92 


o on 

320 


sucrose operon repressor ScrR 


UKrUlooo 


oAG1b9o 


AAA 


N utilization substance protein B 


(JKrUlooy 


O A f^A df\ A 

o AG 1694 


A oo 
129 


conserved hypothetical protein 


ORF01890 


SAG 1695 


186 


translation elongation factor P 


ORF01892 


SAG 1696 


oo 

38 


hypothetical protein 


I ORF01893 


SAG 1697 


48 


hypothetical protein 


ORF01 894 


SAG 1698 


99 


conserved hypothetical protein 


ORF01895 


SAG 1699 


30 


hypothetical protein 


ORF01896 


SAG 1700 


76 


hypothetical protein 


ORF01897 


SAG1701 


56 


hypothetical protein 


ORF01898 


SAG1702 


41 


hypothetical protein 


ORF01899 


SAG1703 


54 


hypothetical protein 


ORF01900 


SAG 1704 


150 


cytidine/deoxycytidylate deaminase family protein 


ORF01902 


SAG 1705 




peptidase, M24 family POINT MUTATION | 


ORF01903 


SAG 1706 


238 


conserved hypothetical protein 


ORF01904 


SAG 1707 


499 


drug resistance transporter, EmrB/QacA family 


ORF01905 


SAG1 708 


38 


1 ^_ 11 f ■ ■ mm 

hypothetical protein 


ORF01906 


SAG 1709 


942 


excinuclease ABC, A subunit 


^"*r^r™o<< aot 

ORF01907 I 


SAG1710 


223 


conserved hypothetical protein 


/•\ r"i r - * o a ooo 

ORF01908 


SAG 1711 


314 


magnesium transporter, CorA family 


/— \ r-~%i~r\ a ooo. 

ORF01909 


SAG1712 


TO 

79 


ribosomal protein S18 


Anr*n a a a a 

ORF01910 


SAG1713 


A OO 

163 


single-strand binding protein 


r»OCAH AH A 

URF0191 1 


SAG 1714 


95 


ribosomal protein S6 


(JRru1912 ! 


SAG1715 


374 


A/G-specific adenine glycosylase 


ORFQ1913 


SAG 1716 


A OT 

197 


transcriptional regulator, Cro/CI family 


UKrU19 14 


oAbl f 1 ( 


104 


tnioredoxin 


nncn-t a -4 c 
vJKrOlylo 


oAbl r 1o 


^ oo 
1 DO 


PAP2 tamily protein 


ORF01916 


SAG1719 


77ft 

779 


MutS2 family protein 


/"\r5r7A«* a«* t 

ORF01917 


SAG 1720 


•4 OA 

180 


conserved hypothetical protein 


/"*\ n r~ o /< o a o 

ORF01918 


SAG 1721 


A OO 

103 


conserved hypothetical protein 


ORF01919 


SAG 1722 


OAT 

297 


— . — . . — 1 — . — — 1 till 

nbonuclease Hill 


f\ r~\r-/% A AAA 

ORF01920 


O A A TOO 

SAG 1723 


A AT 

197 


signal peptidase 1 


ORF01921 


SAG 1724 


O AO 

806 


helicase, putative 


ORF01922 


O A W TOT 

SAG1 725 


160 


conserved hypothetical protein 


AOCAH aoo 

ORF01923 


SAG1 726 


OO A 

364 


DNA-damage inducible protein P 


/™\l3CrA<4 AO /I 

ORF01924 


O A A TO "7 I 

SAG1 727 


TTA 

770 


formate acetyltransferase 


UKrU192t> 


TOO 

oACal 72o 


A r\ A 

124 


rMN-Dinding protein 


/—\ r-j r~o a aoo. 

URrUi92b 


O A H TOO 

o AG 1729 


O AA 

309 


conserved hypothetical protein 


ADCA4 007 


O A <^~* 4 TO O 


OC A 

251 


proteinase, putative, degenerate, FRAMESHIFT 


ORru192o 


O A /^H TO H 

OAG1731 


O AO 

298 


membrane protein, putative 


ORF01929 


O A >l TOO 

SAG 1732 


o o o 

282 


glycerol uptake facilitator protein, putative 


ORF01930 


SAG 1733 


150 


universal stress protein family 


ORF01931 


SAG1734 


400 


transporter, putative 


ORF01932 


SAG 1735 


219 


transcriptional regulator, Crp/Fnr family 


ORF01933 


SAG1736 


761 


X-pro dipeptidyl-peptidase 


ORF01934 


SAG 1737 


119 


hypothetical protein 


ORF01936 


SAG 1738 


326 


polyprenyl synthetase family protein 


ORF01937 


SAG 1739 


582 


ABC transporter, ATP-binding protein CydC 


ORF01938 


SAG1 740 


572 


ABC transporter, ATP-binding protein CydD 


ORF01939 


SAG1741 


339 


cytochrome d ubiquinol oxidase, subunit II 


ORF01940 


SAG1742 


475 


cytochrome d oxidase, subunit 1 


ORF01941 


SAG1743 


402 


pyridine nucleotide-disulphide oxidoreductase family 
protein 
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ORF01942 


SAG 1744 


299 


prenyltransferase, UbiA family 


ORF01943 


SAG 1745 


148 


hypothetical protein ' 


ORF01944 


SAG 1746 


35 


hypothetical protein 


ORF01945 


SAG1747 


99 


conserved hypothetical protein TIGR0010J 


ORF01946 


SAG 1748 


396 


cyclopropane-fatty-acyl-phospholipid synthase 


ORF01947 


SAG1749 


241 


transcriptional regulator, merR family 


ORF01948 


SAG1750 


195 


exonuclease 


ORF01949 


SAG1751 


178 


conserved hypothetical protein 


ORF01950 


SAG1752 


375 


conserved hypothetical protein TIGR00275 


ORF01951 


SAG1753 


260 


conserved hypothetical protein 


ORF01952 


SAG1754 


89 


ribosomal protein S14 


ORF01953 


SAG1755 


38 


hypothetical protein 


ORF01954 


SAG1756 


341 


conserved hypothetical protein 


ORF01957 


SAG 1757 


336 


O-sialoglycoprotein endopeptidase family protein 


ORF01958 


SAG1758 


135 


ribosomal-protein-alanine acetyltransferase, putative 


ORF01960 


SAG1 759 


230 


glycoprotease family protein, putative 


ORF01961 


SAG 1760 


76 . 


conserved hypothetical protein 


ORF01962 


SAG 1761 


559 


metallo-beta-lactamase superfamily protein ! 


ORF01963 


SAG 1762 


169 


conserved hypothetical protein 


ORF01964 


SAG 1763 


A A O 

448 


glutamine synthetase, type 1 


ORF01965 


SAG 1764 


123 


transcriptional regulator GlnR 


ORF01967 


SAG 1765 


179 


conserved hypothetical protein 


ORF01969 


SAG 1766 


398 


phosphoglycerate kinase 


ORF01970 


SAG 1767 


289 


acid phosphatase 


ORF01971 


SAG 1768 


336 


glyceraldehyde 3-phosphate dehydrogenase 


ORF01972 


SAG 1769 


692 


translation elongation factor G 


ORF01973 


SAG1770 


156 


ribosomal protein S7 


ORF01974 


SAG 1771 


137 


ribosomal protein S12 


ORF01975 


SAG1 772 


270 


pur operon repressor 


ORF01976 


SAG 1773 


313 


HD domain protein 


ORF01977 


SAG 1774 


424 


conserved hypothetical protein 


ORF01978 


SAG 1775 


210 


conserved hypothetical protein 


ORF01979 


SAG 1776 


y-\ ji-n /n 

220 


ribulose-phosphate 3-epimerase 


ORF01980 


SAG 1777 


290 


conserved hypothetical protein TIGR00157 


ORF01981 


SAG 1778 


S-\ jr\ f-\ 

283 


rRNA (guanine-NI-)-methyltransferase, putative 


ORF01983 


SAG 1779 


290 


dimethyladenosine transferase 


ORF01984 


SAG 1780 


163 


hypothetical protein 


ORF01985 


SAG 1781 


186 


primase-related protein 


ORF01987 


SAG 1782 


260 


deoxyribonuclease, TatD family 


ORF01988 


SAG 1783 


90 


hypothetical protein 


ORF01989 


SAG 1784 


130 


hypothetical protein 


I ORF01990 


SAG1785 


430 


hypothetical protein 


ORF01991 


SAG1786 


130 


hypothetical protein 


ORF01992 


SAG1787 


420 


dltD protein 


ORF01993 


SAG 1788 


79 


D-alanyl carrier protein 


ORF01994 


SAG 1789 


421 


dltB protein 


ORF01996 


SAG1790 


511 


D-alanine-activating enzyme 


ORF01997 


SAG1791 


395 


sensor histidine kinase ! 


ORF01998 


SAG1792 


224 


DNA-binding response regulator 


ORF01999 


SAG1 793 


44 


ribosomal protein L34 


ORF02000 


SAG1794 


451 


membrane protein, putative 


ORF02001 


SAG1795 


388 


transposase, IS30 family, putative 


ORF02002 


SAG1796 


575 


amino acid ABC transporter, permease protein 


ORF02004 


SAG1797 


407 


amino acid ABC transporter, ATP-binding protein 
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ORF02005 


SAG1798 


39 


hypothetical protein 


ORF02006 


SAG1799 


792 


xylulose-5-phosphate/fructose-6-phosphate 

m I 1 It 1 

phosphoketolase 


ORF02007 


SAG1800 


363 


conserved hypothetical protein 


ORF02008 


SAG1801 


559 


transcriptional antiterminator, BgIG family 


ORF02009 


SAG1802 


253 


conserved hypothetical protein 


ORF02010 


SAG1803 


505 


carbohydrate kinase, FGGY family 


ORF02011 


SAG1804 


329 


hypothetical protein 


ORF02012 


SAG1805 


483 


PTS system component, putative 


ORF02015 


SAG1806 


318 


glyoxylate reductase, NADH-dependent 


ORF02016 


SAG1 807 


339 


hypothetical protein 


ORF02017 


SAG1808 


327 


sugar binding transcriptional regulator, Lacl family 


ORF02018 


SAG 1809 


215 


transaidolase family protein 


ORF02019 


SAG1810 


238 


carbohydrate isomerase, AraD/FucA family 


ORF02020 


f\ a S~S A O A A 

SAG1811 


O O - 7 

287 


hexulose-6-phosphate isomerase, putative 


ORF02021 


SAG1812 


AO A 

221 


hexuIose-6-phosphate synthase, putative | 


ORF02022 


SAG1813 


161 


PTS system, IIA component 


ORF02023 


SAG1814 


92 


PTS system, KB component 


ORF02024 


SAG1815 


479 


transport protein SgaT, putative 


ORF02025 


SAG1816 


205 


hypothetical protein 


ORF02026 


SAG1817 


157 


hypothetical protein 


ORF02027 


SAG1818 


430 


adenylosuccinate synthetase 


ORF02028 ! 


SAG1819 


o ^ r\ 

340 


perfringolysin O regulator protein 


ORF02029 


SAG1 820 


224 


conserved hypothetical protein 


ORF02030 


SAG1821 


750 


glutamate— cysteine hgase-related protein 


ORF02031 


SAG1 822 


272 


conserved hypothetical protein 


ORF02032 


SAG 1823 


418 


conserved hypothetical protein 


ORF02033 


SAG 1824 


291 


chaperonin, 33 kDa 


ORF02034 


SAG 1825 


325 


NifR3/Smm1 family protein 


ORF02035 


SAG 1826 


213 


deoxynucleoside kinase family protein 


ORF02036 


SAG1 827 


163 


phosphinothncin N-acetyltransferase 


ORF02037 


SAG1828 


815 


ATP-dependent Clp protease, ATP-binding subunit 


ORF02038 


SAG 1829 


a tr a 

154 


transcriptional regulator CtsR 


ORF02039 


SAG 1830 


A C O 

153 


conserved hypothetical protein 


ORF02040 


SAG1831 


O A & 

346 


translation elongation Tactor 1 s 


ORF02041 


o A s~y A 000 

SAG1 832 


256 


ri bosom a l protein 


ORF02042 


c> A A oo o 

SAG 183 3 




alKyi nyaroperoxiae reauctase, suDunit o 


ORF02043 


SAG 1834 


rr A r\ 

510 


aikyi nyaroperoxiae reauctase, suuunit r 


ORr0zU44 


O A <^<4 OO C 

SAG 1835 


1o4 


conservea nypotneticai protein 


ORF02045 


SAG1 836 


OJ 


conservea nypotneticai protein 


ORF02046 


SAG 1837 


AdQ 
4DO 


lysin, putative 


ORF02047 


SAG 1838 


A A A 

109 


noun, putative 


ORF02048 


SAG 1839 


136 


conserved hypothetical protein 


ORF02049 


SAG1 840 


112 


■ |f at • 

hypothetical protein 


ORF02050 


O A /"^ H O A A 

SAG 1841 


To 


conservea aomain protein 


ORF02051 


SAG1842 


1224 


PbIB, putative 


ORF02053 


SAG 1843 


240 


conserved hypothetical protein 


ORF02056 


SAG 1844 


911 


conserved hypothetical protein 


ORF02057 


SAG 1845 


42 


hypothetical protein 


ORF02058 


SAG 1846 


158 


hypothetical protein 


ORF02059 


SAG 1847 


227 


conserved hypothetical protein 


ORF02060 


SAG 1848 


114 


conserved hypothetical protein 


ORF02061 


SAG 1849 


115 


hypothetical protein 
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ORF02062 


SAG1850 


101 


hypothetical protein 


ORF02063 


SAG1851 


111 


conserved domain protein 


ORF02064 


SAG 1852 ; 


420 


conserved domain protein 


ORF02066 


SAG1 853 


180 


protease, putative 


ORF02067 


SAG 1854 j 


380 


1 ft ■ I JL* 1 X * 

conserved hypothetical protein 


ORF02068 


SAG1855 


570 


terminase large subunit, putative 


ORF02069 


SAG1856 


161 


hypothetical protein 


ORF02070 


SAG1858 


95 


hypothetical protein 


ORF02071 


SAG1859 


180 


site-specific recombinase, phage integrase family 


ORF02072 


SAG 1860 


154 


conserved hypothetical protein 


ORF02073 


SAG1861 


119 


transcriptional regulator, Cro/CI family 


ORF02075 


SAG1862 


86 


hypothetical protein 


ORF02076 


SAG1863 


138 


single-strand binding protein 


ORF02077 


SAG1864 


68 


hypothetical protein 


ORF02078 


SAG1865 


74 


conserved hypothetical protein 


ORF02079 


SAG1 866 


109 


conserved hypothetical protein 


ORF02080 


SAG1867 


163 


conserved hypothetical protein 


ORF02081 


SAG1868 


134 


hypothetical protein 


ORF02082 


SAG 1869 


437 


type II DNA modification methyltransferase, putative 


ORF02083 


SAG 1870 


273 


UNA replication protein unau, putative 


ORF02084 


SAG1871 


z4o 


conserved nypotneticai protein 


ORF02085 


SAG 1872 


20U 


nypotneticai protein 


ORF02086 


SAG 1873 


443 


repncative una neiicase 


i API PAOAOT 

ORF02087 


oA(j1o/4 


0~7 

87 


nypotneticai protein 


ORF02088 


SAG 1875 


94 


conserveci nypotneticai protein 


ORF02089 


SAG1876 


176 


HNH endonuclease family protein 


ORF02090 


SAG 1877 


236 


antirepressor protein, putative 


ORF02091 


SAG 1878 


102 


conserved domain protein 


ORF02092 


SAG1879 


156 


hypothetical protein 


ORF02093 


SAG 1880 


54 


hypothetical protein 


ORF02094 


SAG1881 


51 


hypothetical protein 


ORF02095 


SAG1882 


120 


repressor protein, putative 


ORF02097 


SAG1884 


134 


hypothetical protein 


ORF02098 


SAG1885 


356 


site-specific recombinase, phage integrase family 


ORF02100 


SAG1 880 




nypotneticai protein 


ORF02101 


O A 4 0 0*7 

SAG 1887 


ooo 

589 


Na+/n+ excnanger Tamiiy protein 


ORF02102 


O A H OOO 

SAG 1888 


7o 


nypotneticai protein 


ORF02103 


oAolooy 


Ol / 


microcm immunity protein iviccr, putative 


ORF02104 


O A A Oft A 




enoopeptiaase \j 


ORF02105 


SAG 1891 


007 


oxiaoreouctase, oio/ian/iviocA Tamiiy 


I ORF02107 


SAG1 892 


oco 

358 


membrane protein, putative 


ORF02108 


SAG1893 


59 


hypothetical protein 


ORF02109 


SAG1894 


214 


Cyclic nucleotide-binding domain protein 


ORF021 10 


oACjISyo 


zU4 


poiypepttue aerormyiase 


ORF021 1 1 


SAG1896 


333 


sugar binding transcriptional regulator RegR 


ORF02112 


SAG1897 


634 


conserved hypothetical protein 


ORF02113 


SAG1898 


271 


PTS system, 1ID component 


ORF02114 


SAG1899 


288 


PTS system, IIC component 


ORF02115 


SAG1900 


164 


PTS system, KB component 


ORF021 1 6 


SAG1901 


398 


glucuronyl hydrolase 


ORF02118 


SAG1902 


144 


PTS system, HA component 


ORF02119 


SAG1 903 


34 


hypothetical protein 
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ORF02120 


SAG 1904 

i ! 


270 


oxido reductase, short-chain dehydrogenase/reductase 
family 


ORF02121 


SAG 1905 


212 


conserved hypothetical protein 


ORF02122 


SAG 1906 


335 


carbohydrate kinase, PfkB family 


ORF02123 


SAG1 907 


212 


2-dehydro-3-deoxyphosphog luconate aldoiase/4- 
hydroxy-2-oxoglutarate aldolase 


ORF02124 


SAG 1908 


499 


hypothetical protein 


ORF02125 


SAG 1909 


204 


nitroreductase family protein 


ORF02126 


SAG1910 


141 


transcriptional regulator, MarR family 


ORF02127 


SAG1911 


1468 


DNA polymerase III, alpha subunit, Gram-positive type 


ORF02128 


SAG1912 


194 


N-acetylmuramoyl-L-alanine amidase, family 4 protein 


ORF02129 


SAG1913 


617 


prolyl-tRNA synthetase 


ORF02130 


SAG1914 


419 


membrane-associated zinc metalloprotease, putative 


ORF02131 


/"X a At A .A ^ 

SAG1915 


264 


phosphatidate cytidylyltransferase 


ORF02132 


SAG1916 


250 


undecaprenyl diphosphate synthase 


ORF02133 


SAG1917 


113 


preprotein translocase, YajC subunit 


ORF02134 


^x a y"N a ^\ a a 

SAG1918 


mm A A 

114 


conserved hypothetical protein 


ORF02135 


SAG1919 


A A •mm 

387 


malate oxidoreductase 


ORF02136 


A A /-v A A 

SAG 1920 


445 


citrate carrier protein, CCS family 


ORF02137 


SAG1921 | 


508 


sensor histidine kinase family protein 


ORF02138 


SAG 1922 


AAA 

229 


response regulator 


ORF02139 


SAG 1923 


331 


UDP-glucose 4-epimerase 


ORF02140 


SAG 1924 


A f— 

535 


glucan 1 ,6-alpha-glucosidase 


ORF02141 


SAG 1925 


A ■ ■ 1 P 

377 


sugar ABC transporter, ATP-binding protein 


ORF02142 


/*—». A /*\ J A ,*X A 

SAG 1926 


/*A A 

283 


helix-turn-helix domain protein, fis-type 


ORF02143 


SAG 1927 


298 


lacX protein 


ORF02144 


SAG 1928 


A A f-» 

325 


tagatose 1 ,6-diphosphate aldolase 


ORF02145 


SAG 1929 


310 


tagatose-6-phosphate kinase 


ORF02146 


SAG 1930 


171 


galactose-6-phosphate isomerase, LacB subunit 


ORF02147 


SAG1931 


141 


galactose-6-phosphate isomerase, LacA subunit 


ORF02148 


SAG1932 


816 


neuraminidase 


ORF02149 


SAG 1933 


482 


PTS system, I JO component, putative 


ORF02150 


SAG1934 


101 


PTS system, IIB component, putative 


ORF02152 


SAG1935 


157 


PTS system, IIA component, putative 


ORF02153 


SAG1936 


258 


lactose phosphotransferase system repressor 


ORF02156 


SAG1937 


i 


streptococcal histidine triad family protein, degenerate, 
FRAMESHIFT 


ORF02157 


SAG1 938 


A A *-* 

307 


adhesion lipoprotein, putative 


ORF02158 


A A A A A 

SAG1 939 


A A ~T 

147 


conserved hypothetical protein TIGR00256 


ORF02159 


SAG1 940 


A A 

738 


GTP pyrophosphokinase 


ORF02160 


SAG1941 


800 


2 ,3 -cyclic-nucleotide 2 -phosphodiesterase 


ORF02161 


SAG 1942 


151 


|1 Ml | ft » 

nrdl protein, putative 


ORF02162 


SAG1943 


345 


conserved hypothetical protein 


ORF02163 


SAG1 944 


165 


conserved hypothetical protein 


ORF02164 


#"X A ^% A A A 9^ 

SAG 1945 


A if" 

345 


iron ABC transporter, iron-binding protein 


ORF02165 


SAG1946 


257 


DNA-binding response regulator 


ORF02166 


SAG1947 


549 


conserved hypothetical protein 


ORF02167 


SAG1948 


275 


PTS system, HD component 


ORF02168 


SAG1949 


269 


PTS system, HC component 


ORF021 69 


SAG1 950 


163 


PTS system, IIB component 


ORF02170 


SAG1951 


141 


PTS system, IIA component, putative 


ORF02171 


SAG1952 


353 


membrane protein, putative 


ORF02172 


SAG1953 


60 


hypothetical protein 
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ORF02173 


SAG 1954 


384 


hypothetical protein 


ORF02174 


SAG 1955 


282 


ABC transporter, ATP-binding protein 


ORF02175 


SAG 1956 


96 


conserved domain protein 


ORF02176 


SAG 1957 


250 


response regulator 


ORF02177 


SAG 1958 


276 


conserved hypothetical protein 


ORF02178 


SAG 1959 


727 


PTS system, IIABC components 


ORF02179 


SAG 1960 


551 


sensor histidine kinase 


ORF02180 


SAG1961 


225 


phosphate reguion response regulator PhoB 


ORF02181 


SAG 1962 


218 


phosphate transport system regulatory protein PhoU, 
putative 


ORF02182 


SAG1963 


253 


ft m t m. i _ . a jk ' l g v I * ■* t * 

phosphate ABC transporter, ATP-binding protein 


ORF02183 


SAG 1964 


292 


phosphate ABC transporter, permease protein 


ORF02184 


, SAG1 965 


281 


phosphate ABC transporter, permease protein 


ORF02186 


SAG1966 


293 


hemolysin precursor, putative 


ORF02187 


SAG1967 


195 


hypothetical protein 


ORF021 88 


SAG1968 


246 


conserved hypothetical protein TIGR00046 


ORF02189 


SAG 1969 


317 


nbosomal protein L1 1 methyltransferase 


ORF02190 


SAG 1970 


102 


conserved hypothetical protein 


ORF02191 


SAG1971 


41 


hypothetical protein 


ORF02192 


SAG 1972 


238 


transcriptional regulator, MerR family 


ORF02194 


SAG1973 


156 


acetyltransferase, GNAT family | 


ORF021 95 


SAG 1974 


152 


MutT/nudix family protein 


ORF02196 


SAG 1975 


47 


ft i ■ r " ftJ * * 

hypothetical protein 


ORF02197 


SAG 1976 


156 


conserved hypothetical protein 


ORF02198 


SAG1977 


163 


■ ftJ ft «* a, 1 j| ■■ ■ ■ f~ ■ 1 

acetyltransferase, GNAT family 


ORF02199 


SAG1978 


422 


ATPase, AAA family 


ORF02201 


SAG1979 


253 


hypothetical protein 


ORF02202 


SAG1980 


300 


ABC transporter, ATP-binding protein 


ORF02203 


SAG 1981 


68 


hypothetical protein 


ORF02205 


SAG1982 


359 


transcriptional regulator, Cro/CI family 


ORF02206 


SAG1983 


105 


II Ml ft* 1 ft* 

conserved hypothetical protein 


ORF02207 


SAG1984 


188 


conserved hypothetical protein TIGR00730 


ORF02208 


SAG1985 


51 


hypothetical protein 


ORF02209 


SAG1986 


375 


integrase, phage family, putative 


ORF02210 


SAG1987 


61 


conserved hypothetical protein 


ORF0221 1 


SAG 1988 


342 


conserved hypothetical protein 


ORF02212 


SAG 1989 


139 


hypothetical protein 


ORF02213 


SAG1990 


127 


hypothetical protein 


ORF02214 


SAG1991 


204 


transcriptional regulator, Cro/CI family 


ORF02215 


SAG 1992 


518 


conserved hypothetical protein 


ORF02216 


SAG1993 


373 


site-specific recombinase, phage integrase family 


ORF02217 


SAG 1994 


108 


conserved hypothetical protein 


ORF02219 


SAG1995 


210 


hypothetical protein 


ORF02221 


SAG1996 


263 


cell wall anchor protein-related protein 


ORF02223 


SAG1997 


182 


hypothetical protein 


ORF02224 


SAG 1998 


457 


r . _ ti_ _if_ _■ 

hypothetical protein 


ORF02225 


SAG1999 


47 


hypothetical protein 


ORF02226 


SAG2000 


666 


membrane protein, putative 


ORF02227 


SAG2001 


756 


conjugal transfer protein, interruption-C 


ORF02228 


SAG2002 


129 


IS1381, transposase OrfB 


ORF02229 


SAG2003 


127 


IS1 381 , transposase OrfA j 


ORF02230 


SAG2005 


136 


conserved hypothetical protein 


ORF02231 


SAG2006 


88 


conserved hypothetical protein 


ORF02232 


SAG2007 


317 


conserved hypothetical protein 
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ORF02233 


bAG2UUo 


84 


conserved nypotneticai protein 


ORF02234 


oAG2009 


88 


conserved nypotneticai protein 


QRF02235 


O AOOAH A 

oACd2U1 U 


1 57 


nypotneticai protein 


(JKr022ob 


O A AOA-1 A 


IbO 


conserved nypotneticai protein 


An r - a a oi7 

ORF02237 


o a oon-i o 


A A 

90 


nypothetical protein 


ORF02238 


OA o 

oAG201o 


189 


hypothetical protein 


ORF02239 


OA/^AA^ 

SAG2014 


A A C\ 

449 


hypothetical protein 


ORF02240 ! 


SAG2015 


99 


transcriptional regulator, Oro/Cl family 


ORF02241 


SAG201 6 


125 


hypothetical protein 


ORF02242 


SAG2017 


429 


transcriptional regulator, Cro/Cl family 


ORF02243 


SAG2018 


553 


FtsK/SpolllE family protein 


ORF02244 


SAG2019 


153 


hypothetical protein 


ORF02245 


SAG2020 


98 


hypothetical protein 


ORF02246 


SAG2021 


826 


cell wall surface anchor family protein 


ORF02247 


SAG2022 


417 


transposase, ISL3 family 


ORF02249 


SAG2023 


546 


mercuric reductase 


ORF02250 


SAG2024 


130 


mercuric resistance operon regulatory protein MerR 


ORF02251 


oa r*or>oc 


C A A 

522 


Mn2+/Fe2+ transporter, iNKAMr' Tamiiy 


ORF02252 


SAG2026 


A A A 

240 


membrane protein, putative 


ORr0225o 


OA P0A07 


OAC 

205 


Abo transporter, A 1 K-Dinaing protein 


ORF02254 


O A AOftOO 


O £> 
OO 


conserved nypotneticai protein 


ORF02255 


O A POAOfl 


2o4 


streptomycin resistance protein 


ORF02257 


o a Aonon 
oAG20oO 


A OA 

130 


nypotneticai protein 


ADcmoco 
OKr0225o 


O A OOAO -1 


OA A 


nypotneticai protein 


ORF02259 


o a Aono o 


AAA 
111 


conserved nypotneticai protein 


ORF02260 


O A A A D O 

SAG2033 


H CO 

1o2 


acetyltransterase, oNA l tamiiy 


ADCflOOC H 

<JKr022o1 


O A /"* O AO A 


24 f 


memorane protein, putative 


ORF02262 


OAG2035 


O A A 

300 


Abo transporter, A 1 r'-Dinding protein 


ORF02263 


O A /"»A AO £> 

SAG2036 


bo 


nypotneticai protein 


ORF022b4 


o a r>oni -7 
OAG2037 


O CO 

o5o 


transcriptional regulator, uro/oi Tamiiy 


ORF02265 


O A A AO O 

SAG2038 


OA /I 

204 


hahz tamiiy protein 


ORF02266 


SAG2039 


AO 

98 


conserved hypothetical protein 


ORF02267 


SAG2040 


186 


conserved hypothetical protein T1GR00730 


ORF02268 


SAG2041 


287 


protease, putative 


ORF02269 


SAG2042 


100 


rhodanese family protein 


ORF02270 


SAG2043 


255 


cAMP factor 


ORF02271 


SAG2044 


62 


hypothetical protein 


ORF02272 


SAG2045 


179 


DNA topology modulation protein FlaR, putative 


ORF02273 


SAG2046 


361 


glycerol dehydrogenase, putative 


ORF02274 


SAG2047 


235 


conserved hypothetical protein 


ORF02275 


SAG2048 


614 


5-methyltetrahydrofolate-homocysteine 








methyltransf erase, putative 


ORF02276 


SAG2049 


745 


5-methyitetrahydropteroyltriglutamate-homocysteine 








methyltransferase 


ORF02277 


SAG2050 


107 


conserved hypothetical protein 


ORF02278 


SAG2051 


230 


branched-cham amino acid transport protein AzIC, 








putative 


ORF02279 


SAG2052 


41 


hypothetical protein 


ORF02280 


SAG2053 


1570 


serine protease, subtilase family, putative 


ORF02281 


SAG2054 


228 


DNA-binding response regulator 


ORF02282 


SAG2055 


462 


sensor histidine kinase 


ORF02283 


SAG2056 


202 


chromosome assembly-related protein 


ORF02285 


SAG2057 


833 


leucyl-tRNA synthetase 


ORF02286 


SAG2058 


415 


major facilitator family protein 
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ORF02287 


SAG2059 


281 


conserved hypothetical protein 


ORF02288 


SAG2060 


398 


glycosyl transferase, family 8 


ORF02289 


SAG2061 


401 


glycosyl transferase, family 8 


ORF02290 


SAG2062 


179 


transcription antitermination protein NusG 


ORF02291 


SAG2063 


630 


pathogenicity protein, putative 


ORF02292 


SAG2064 


57 


preprotein translocase, SecE subunit, putative 


ORF02293 


SAG2066 


773 


penicillin-binding protein 2A 


ORF02294 


SAG2067 


294 


ribosomal large subunit pseudouridine synthase, RluD 
subfamily 


ORF02295 


SAG2068 


546 


Lyme disease proteins of unknown function, putative 


ORF02296 


SAG2069 


403 


phosphopentomutase 


ORF02297 


SAG2070 


223 


i *t t • j* lit 

deoxynbose-phosphate aldolase 


ORF02298 


SAG2071 


400 


Na+ dependent nucleoside transporter 


ORF02300 


SAG2072 


259 


undine phosphorylase 


ORF02301 


SAG2073 


245 


transcriptional regulator, GntR family j 


ORF02302 


SAG2074 


540 


60 kda chaperonin j 


ORF02303 


SAG2075 


94 


chaperonin, 10 kDa 


ORF02305 


SAG2076 


267 


ABC transporter, ATP-binding protein 


ORF02306 


SAG2077 


298 


ABC transporter, permease protein 


ORF02307 


SAG2078 


320 


lipoprotein, putative 


ORF02308 


SAG2079 


265 


hydrolase, haloacid dehalogenase-like family 


ORF02309 


SAG2080 


286 


glyoxalase family protein 


ORF02310 


SAG2081 


243 


conserved hypothetical protein 


ORF0231 1 

■ 


SAG2082 


205 


anaerobic ribonucleoside-triphosphate reductase 
activating protein 


ORF02312 


SAG2083 


163 


acetyltransferase, GNAT family 


ORF02313 


SAG2084 


310 


virulence factor MviM, putative 


ORF02314 


SAG2085 


47 


conserved hypothetical protein 


ORF02315 


SAG2086 


723 


anaerobic ribonucleoside-triphosphate reductase 


ORF02316 


SAG2087 


495 


conserved hypothetical protein 


ORF02317 


SAG2088 


40 


hypothetical protein 


ORF0231 8 


SAG2089 


105 


conserved hypothetical protein 


ORF02319 


SAG2090 


136 


conserved hypothetical protein TIGR00250 


ORF02320 


SAG2091 


88 


conserved hypothetical protein 


ORF02321 


SAG2092 


132 


conserved hypothetical protein 


ORF02322 


SAG2093 


379 


recA protein 


ORF02323 


SAG2094 




competence/damage-inducible protein CinA 
FRAMESHIFT 


ORF02325 


***m A AW^. Awm 

SAG2095 


183 


i k ■ a (ill* l l f 

DNA-3-methyladenine glycosylase 1 


ORF02327 


SAG2096 


196 


II ll » _1 « t • . ft""*. |L 1 A I— _ t * l*"% . mm m A 

Holliday junction DNA hehcase RuvA 


^Ifc hh S*k ^fc _n> 

ORF02328 


SAG2097 


A mi f\ 

418 


transporter, putative 


ORF02329 


SAG2098 


P* ***V 

659 


DNA mismatch repair protein HexB 


ORF02330 


SAG2099 


33 


hypothetical protein 


ORF02331 


SAG2100 


67 


cold shock protein, CSD family 


ORF02332 


SAG2101 


858 


DNA mismatch repair protein HexA 


ORF02333 


SAG21 02 


145 


arginine repressor ArgR, putative 


ORF02334 


SAG2103 


563 


arginyl-tRNA synthetase 


ORF02335 


SAG2104 


102 


conserved hypothetical protein 


ORF02337 


SAG2105 


290 


conserved hypothetical protein 


ORF02338 


SAG2106 


314 


conserved hypothetical protein 


ORF02339 


SAG2107 


583 


aspartyl-tRNA synthetase 


ORF02340 


SAG2108 


426 


histidyl-tRNA synthetase 


ORF02341 


SAG2109 


60 


ribosomal protein L32 
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Table 32: Conversion of ORF Ref Nos. with SAG Ref Nos. 



ORF Ref No. 


SAGxxxx Ref No. 


aa 


Annotation 


ORF02342 


SAG2110 


49 


ribosomal protein L33 


ORF02343 


SAG21 1 1 


173 


conserved hypothetical protein 


ORF02344 


SAG21 1 2 


494 


site-specific recombinase, phage integrase family 


ORF02345 


SAG2113 


82 


conserved hypothetical protein 


ORF02346 


SAG2114 


342 


conserved hypothetical protein 


ORF02347 


SAG2115 


143 


hypothetical protein 


ORF02349 


SAG21 1 6 


151 


conserved hypothetical protein 


ORF02350 


SAG2117 


71 


hypothetical protein 


ORF02351 


SAG2118 


306 


transcriptional regulator, Cro/CI family 


ORF02352 


SAG2119 


373 


conserved domain protein 


ORF02355 


SAG2120 


56 


hypothetical protein 


ORF02356 


SAG2121 


176 


; hypothetical protein 


ORF02357 


SAG2122 


223 


DNA-binding response regulator 


ORF02358 


SAG2123 


454 


sensor histidine kinase 


ORF02359 


SAG2124 


517 


membrane protein, putative 


ORF02360 


SAG2125 


308 


carbamate kinase 


ORF02361 


SAG2126 


332 


ornithine carbamoy {transferase 


ORF02362 


SAG2127 


431 


sensor histidine kinase 


ORF02363 


SAG2128 


277 


response regulator 


ORF02364 


SAG2129 


240 


amino acid ABC transporter, ATP-binding protein 


ORF02365 


SAG2130 


504 


amino acid ABC transporter, permease and amino acid 








binding protein 


ORF02367 


SAG2131 


847 


membrane protein, putative 


ORF02368 


SAG2132 


247 


conserved hypothetical protein 


ORF02369 


SAG2133 


118 


conserved hypothetical protein 


ORF02370 


SAG2134 


772 


membrane protein, putative 


ORF02371 


SAG2135 


179 


transcriptional regulator, TetR family, putative 


ORF02372 


SAG2136 


98 


conserved hypothetical protein 


ORF02373 


SAG2137 


203 


ribosomal protein S4 


ORF02374 


SAG2138 


95 


conserved hypothetical protein 


ORF02375 


SAG21 39 


451 


replicative DNA helicase 


ORF02376 


SAG2140 


150 


ribosomal protein L9 


ORF02377 


SAG2141 


660 


DHH family protein 


ORF02378 


SAG2142 


613 


glucose inhibited division protein A 


ORF02379 


SAG2143 


203 


conserved hypothetical protein TIGR00427 


ORF02380 


SAG2144 


373 


tRNA (5-methylaminomethyl-2-thiouridylate)- 








methyltransferase 


ORF02381 


SAG2145 


222 


L-serine dehydratase, iron-suifur-dependent, beta 








subunit 


ORF02382 


SAG2146 


290 


L-serine dehydratase, iron-sulfur-dependent, alpha 








subunit 


ORF02383 


SAG2147 


234 


conserved hypothetical protein 


ORF02384 


SAG2148 


179 


LysM domain protein 


ORF02385 


SAG2149 


264 


cobalt transport family protein 


ORF02386 


SAG2150 


280 


ABC transporter, ATP-binding protein | 


ORF02387 


SAG2151 


279 


ABC transporter, ATP-binding protein, FRAMESHIFT 


ORF02388 


SAG2152 


180 


CDP-diacylglyceroI-glycerol-3-phosphate 3- ; 








phosphatidyltransferase 


ORF02389 


SAG2153 


427 


peptidase, M1 6 family 


ORF02390 


SAG2154 


414 


conserved hypothetical protein 


ORF02391 


SAG2155 


117 


conserved hypothetical protein 


ORF02392 


SAG2156 


369 


recF protein 


ORF02393 


SAG2157 


278 


transporter, putative 


ORF02395 


SAG2158 


220 


transcriptional regulator, Cro/C! family 
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Table 32: Conversion of ORF Ref Nos. with SAG Ref Nos. 

) 



UKr K©T NO. 


SAGxxxx Ret NO. 


aa 


Annotation 




OAPOi CO. 


4yo 


jnosine-o -monopnospnaxe aenyarogenase 


vjrxru^oy ( 


OAOO^ CA 


•i CH 


transcriptional regulator, ArgK ramiiy 




Onb^ I D 1 


00c 


ik-»» ^^p^nli/Hnol rant tlo4#\r Orr\/IT«i* »-k-» ■ 1 \ y 

transcnptionai reguiaior, orp/rnr Tamiiy 


ORP091QQ 

w r\ i UiC o y y 


CAnO"! CO 


iio4 


consGrvea nypoineiicai protein 




QAA01 CO 


A*l A 
4 1 U 


arginme aeiminase 




OAo^ I04 


^ QC 


«xaa4^ fUr Ank>TAr^oo lOKI AT m i It # 

acetyitransierase, oinai Tamuy 




QAAO-1 CC 

oAv^Z 1 DO 


o»5/ 


ornitnme carDamoyuransierase 




OMot; I Do 




arginme/orniinine antiporter 


npcnoinA 
wrerU<ti4U**- 


OMbZ 1 0 / 




■■t~*k. kM J^^J^\ 1/ 1 W\ 

carDamate Kinase 


/->D CAO/1 AC 


OA^O-ICQ 

oAva^loo 


o41 


tryptopnanyi-tKNA syntnetase / 


/~\r~} eTAOvl AC 




OO A 


conserved nypotnetical protein 


ORF02407 


SAG21 70 


290 


conserved hypothetical protein 


ORF02408 


SAG2171 


539 


Am f— ^k, - . ■ ji pmam\ a « . a 

ABC transporter, ATP-binding protein 


ORF02409 


SAG2172 


859 


ABC transporter, permease protein, putative 


^^^■4 IP"*^k jkkk jfl A ^kkt 

ORF02410 


SAG2173 


159 


conserved hypothetical protein T1GR00246 


ORF0241 1 


SAG2174 


409 


serine protease 


ORF02412 


SAG2175 


257 


a v r * » ■ % ■ k. ■ ik j>> p ■ 

partitioning protein, ParB family 


ORF0241 3 


SAG0001 


453 


chromosomal replication initiator protein DnaA 


ORF02415 


SAG0002 


378 


fmm~ a. j — g 1 i 1 1 ■ ■ • A 

DNA polymerase 111, beta subunit j 


ORF02416 


SAG0003 


293 


k k If- | . « # 1 »7 * 1 k ■ * a a k 

diacylglycerol kinase catalytic domain protein, putative 


\jHr\jdA\ ( 


O A /^AAAyf 
OAC3UUU4 


CC 
DO 


conserved hypothetical protein 


ORF0241o 


SAG0005 


67 


hypothetical protein 


ORF02419 


SAG0006 


371 


conserved hypothetical GTP-binding protein 


ORF02420 


SAG0007 


191 


peptidyl-tRNA hydrolase 


i ORF02421 


SAG0008 


1165 


transcription-repair coupling factor 


ORF02422 


SAG0009 


31 


hypothetical protein 


ORF02423 ! 


SAGO0 10 


90 


S4 domain protein 


^^V. ^"k* |^"> ^*4. J"H. A AfK At 

ORF02424 


SAG001 1 


J ^kv r*. 

123 


cell division protein DivIC, putative 


ORF02425 


SAGO0 12 


44 


conserved hypothetical protein 


ORF02426 | 


SAGO0 13 


428 


conserved hypothetical protein 


ORF02427 


SAG0014 


ji ^ jt 

424 i 


MesJ/Ycf62 family protein 


ORF02428 


SAGO0 15 


180 


hypoxanthine-guanine phosphoribosyltransferase 


An CAO/1 OA 


oAbUUib 


DOO 


ceil division protein rtsn 


ADcnonnn 


O A /*^A»f K7 

oAbUlOr 




unase-reiated protein, Ubt^bNtKAi b 


\JKrUoUU i 


O A /^ACi7Q 

oAbUD/y 




conserved nypotnetical protein 


VJKrUoUU^: 


oAbUOdU 


H H i 

m 


conserved nypotnetical protein, truncation 


VJKrUoUUo 


oAbUOOZ 




i nozoz, vjrr i_o protein, degenerate 


i ADcninn/i 
UKruoUU4 


C A^ARR*^ 


Of 


conserved nypotnetical protein 


UKrUOUUO 


QA^SnCftO 
oAbUDD^ 


IU I 


cyiA protein 


vJKrUOUUD 


CA^OQ-I T 
oAbUo 1 / 


B1 


inyio, nypotnetical protein 


UKrUOUUO 


oAbuy^U 


01 


inyio, nypotnetical protein 


UKrUoUUo 


oAbUy^^ 


01 


inyib, nypotnetical protein 


#■ 'Ifc P* A*\ AT\ AT* J"* A+*. 

ORF03009 


-1— x A A A A At 

SAG0924 


A*ir> • 

28 


_At A^± ■ k ■ *k « Ik J » 1 

Tn916, tetM leader peptide 


ORF03010 


SAG0936 


39 


Tn916, hypothetical protein 


ORF0301 1 


O A 4 AO A 

oAo14o4 


A O 

4o 


ribosomal protein L33 


ORF03012 


SAG1857 


119 


HNH endonuclease family protein 


ORF03013 


SAG1883 


128 


conserved hypothetical protein 


ORF03014 


SAG2065 


50 


ribosomal protein L33 


ORF0301 5 


SAG2004 


67 


conjugal transfer protein, interruption-N 
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Table 33: List of GAS ORFs which are shared with GBS and Spn 



gi 13621326 gb 
gi 13621327 gb 
gi 13621328 gb 
gi 13621329 gb 
gi 13621330 gb 
gi 13621 331 jgb 
gi 13621332|gb 
gi 13621333 gb 
gi 13621334 gb 
gi 13621335 gb 
gi 13621 337 gb 
gi|13621340 gb 
gi|13621341 gb 
gi|1 3621 343 gb 



AAK33146.1 
AAK33147.1 
AAK33148.1 
AAK33149.1 
AAK33 150.1 
AAK33151.1 
AAK33152.1 
AAK33153.1 
AAK33 154.1 
AAK33155.1 
AAK33156.1 
AAK33158.1 
AAK33159.1 
AAK33160.1 



gi 1 1 362 1 344 |gb | AAK33 161.1 

gb|AAK33163.1 
gb|AAK33164.1 
gb|AAK33165.1 



gi 
gi 



13621346 
13621347 
13621348 



gi|1 3621349 



g« 
gi 
gi 



g* 
gi 



gb 
gb 
gb 
gb 



13621350 
13621353 
13621354 
gi|1 3621 355 
gi|1 3621 357 
13621358 
13621359 
gi|13621361|gb 
gi|13621362|gb 
gi|1 3621 363 
gi|13621364 
gi|1 3621 365 
13621366 
13621367 
13621368 
13621369 
gi|13621370 
gi|13621372 

gi 



AAK33166.1 
AAK33167.1 
AAK33169.1 
AAK33170.1 



gb|AAK33171.1 



gi 
gi 
gi 
gi 



AAK33173.1 
AAK33174.1 
AAK33175.1 
AAK33 176.1 
AAK33 177.1 
AAK33 178.1 
gb|AAK33179.1 
gb|AAK33180.1 



gb 
gb 
gb 



gb 



gb 
gb 
gb 
gb 
gb 
gb 



AAK33181.1 
AAK33 182.1 
AAK33183.1 
AAK33184.1 
AAK33 185.1 
AAK33 186.1 



13621 373|gb|AAK33187.1 
1 3621 374|gb|AAK331 88.1 



gi|13621375|gb 



gb 
gb 



gi 13621376 
gi 13621377 
gi 13621378|gb 
gi 13621379|gb 
gi|1 3621 380 
gi|1 3621 382 

gi 



AAK33189.1 
AAK33190.1 
AAK33191.1 
AAK33192.1 
AAK33 193.1 
gb|AAK33194.1 



gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 



gb|AAK33196.1 
1 3621 383|gb|AAK331 97.1 
1 3621 384|gb|AAK331 98.1 



13621385 
13621386 
13621387 
13621388 
13621389 
13621390 
13621391 



gi|1 3621 392 



gb 
gb 
gb 

gb 
gb 

gb 
gb 
gb 



AAK33 199.1 
AAK33200.1 
AAK33201.1 
AAK33202.1 
AAK33203.1 
AAK33204.1 
AAK33205.1 
AAK33206.1 



gi 


13621393 


gb 


AAK33207.1 


gi 


13621394 


gb 


AAK33208.1 


gi 


13621397 


gb 


AAK33210.1 


gi 


13621398 


gb 


AAK33211.1 


gi 


1 362 1 399 |gb | AAK332 12.1 


gi| 


13621401 


gb 


AAK33214.1 


gi 


13621403 


gb 


AAK33215.1 


g> 


13621404 


gb 


AAK33216.1 


gi 


13621405 


gb 


AAK33217.1 


gi 


13621407 


gb 


AAK33218.1 


gi 


13621408|gb 


AAK33219.1 


gi 


13621409 


gb 


AAK33220.1 


qi 


13621413 


gb 


AAK33224.1 


Qi 


13621415 


gb|AAK33226.1 


gi 


13621416 


gb|AAK33227.1 


gi 


13621418 


gb|AAK33229.1 


gi 


13621419 


gb|AAK33230.1 


gi 


13621424 


gb 


AAK33234.1 


gi 


13621425 


gb 


AAK33235.1 


gi 


13621426 


gb|AAK33236.1 


gi 


13621434 


gb 


AAK33243.1 


gi 


13621450 


gb 


AAK33258.1 


gi 


13621455 


gb|AAK33262.1 


gi 


13621456 


gb|AAK33263.1| 


gi 


13621457 


gb 


AAK33264.1| 


gi 


13621467 


gb 


AAK33273.1 


gi 


13621468 


gb 


AAK33274.1 


gi 


13621469 


gb 


AAK33275.1 


gi 


13621470 


gb|AAK33276.1 


gi 


13621471 


gb|AAK33277.1 


gi 


1 13621 472 |gb 


AAK33278.1| 


gi 


13621473|gb 


MK33279.1| 


gi 


13621476|gb 


AAK33281.1| 


gi 


13621477 


gb 


AAK33282.1 


gi 


13621478 


gb 


AAK33283.1 


gi 


13621480 


gb 


AAK33285.1 


gi 


113621481 Igb 


AAK33286.1 


gi 


13621491 


gb|AAK33295.1| 


gi 


13621494 


gb)AAK33298.1| 


gi 


13621496 


gb 


AAK33299.1) 


gi 


13621501 


gb 


AAK33304.1| 


gi 


13621502 


gb 


AAK33305.1 


gi 


13621505 


gb 


AAK33307.1 


gi 


13621506 


gb 


AAK33308.1 


gi 


13621507|gb 


AAK33309.1 


gi 


13621 51 0|gb 


AAK33312.1 


gi 


13621511 


gb|AAK33313.1 


gi 


113621513 


gb|AAK33315.1| 


g« 


13621516 


gb|AAK33317.1 


gi 


13621518 


gb|AAK33319.1 


gi 


13621521 


gb 


AAK33322.1 


gi 


13621522 


gb 


AAK33323.1 


gi 


13621523 


gb 


AAK33324.1 


gi 


|13621524 


gb 


AAK33325.1 


gi 


13621525 


gb 


AAK33326.1 


g» 


13621527 


gb 


AAK33327.1 
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Table 33: List of GAS ORFs which are shared with GBS and Spn 



gi 1 


1 3621 528|gb|AAK33328. 1 1 


gi|1 3621 595 


gb 


AAK33389. 


ail 
y i 


13621529 


gb|AAK33329.1| 


gi|1 3621 596 


gb 


AAK33390. 


ai 


13621530 


gb|AAK33330.1 


gi 


13621597 


gb 


AAK33391. 


ai 

y • 


13621531 


gb|AAK33331.1| 


gi 


13621598 


gb 


AAK33392. 


ai 
y* 


13621532 


gb 


AAK33332.1| 


gi 


13621599|gb 


AAK33393. 


ai 
y* 


13621533 


gb 


AAK33333.1 


gi 


13621600|gb 


AAK33394. 


ai 
y 1 


13621534 


gb 


AAK33334.1 


gi 


13621602 


gb 


AAK33395. 


ai 
y 1 


13621535 


gb 


AAK33335.1| 


gi 


13621603 


gb 


AAK33396. 


ai 
y* 


13621536 


gb 


AAK33336.1 


g« 


13621604 


gb 


AAK33397. 


ail 
y*l 


13621537 


gb 


AAK33337.1 


gi 


13621605 


gb 


AAK33398. 


ail 
y'l 


1 3621 539|gb|AAK33338.1 1 


gi 


13621606 


gb 


AAK33399. 


ail 
y*i 


1 3621 540|gb|AAK33339\1 


gj 


13621607 


gb 


AAK33400. 


ail 

y i 


13621541 |gb|AAK33340.1 


gi 


13621608|gb 


AAK33401. 


ail 

y i 


13621542|gb|AAK33341.1| gi 


13621609 


gb 


AAK33402. 


ai 

y 1 


13621543 


gb 


AAK33342.1 


gi 


13621611 


gb 


AAK33404. 


ai 

y 


13621544 


gb 


AAK33343.1 


gi 


13621614 


gb 


AAK33406. 


ai 

y 


13621546 


gb 


AAK33345.1 


gi 


13621615 


gb|AAK33407. 


ail 

y*i 


13621547 


gb 


AAK33346.1 


gi 


13621616 


gb 


AAK33408. 


ai 
y* 


13621548 


gb 


AAK33347.1 


gi 


13621617 


gb 


AAK33409. 


ai 
y* 


13621550|gb 


AAK33348.1 


gi 


13621618 


gb 


AAK33410. 


Qi 

y • 


13621551 


gb 


AAK33349.1 


gi 


13621 61 9|gb 


AAK3341 1 . 


Qi 

y ■ 


13621552 


gb 


AAK33350.1 


gi 


13621620|gb 


AAK33412. 


Qi 

y ' 


13621553 


gb 


AAK33351.1I gi 


13621 621 |gb 


AAK33413. 


ai 

y 1 


13621554 


gb 


AAK33352.1 


gi 


13621622|gb 


AAK33414. 


ai 

y ■ 


13621555 


gb 


AAK33353.1 


gi|13621623|gb 


AAK33415. 


ai 

y 


1 3621 557|gb|AAK33355.1 


gi|13621624 


gb 


AAK33416. 


ai 

y 


1 3621 559|gb|AAK33356.1 


gi|1 3621 625 


gb 


AAK33417. 


ail 

y*i 


13621560|gb|AAK33357.1 1 gi|13621627|gb|AAK33419. 


ar 


13621561 


gb 


AAK33358.1 


| gi|13621629|gb 


AAK33420. 


ai 
y 


13621562 


gb 


AAK33359.1 


gi|13621630|gb 


AAK33421 . 


ai 

y 1 


13621563|gb 


AAK33360.1 


gi 


13621631 |gb 


AAK33422. 


ai 
y 1 


13621564 


gb 


AAK33361.1 


gi 


13621633 


gb 


AAK33424. 


ai 
y 1 


13621565 


gb 


AAK33362.1 


gi 


13621634 


gb 


AAK33425. 


ai 
y 1 


1 3621 566|gb|AAK33363.1 


gi 


13621636 


gb 


AAK33427. 


ai 

y • 


1 3621 567|gb|AAK33364.1 


gi 


13621637 


gb 


AAK33428. 


ai 
y 


1 3621 569|gbiAAK33365.1 


gi 


13621638 


gb|AAK33429. 


* ai 

y 


13621571 


gb 


AAK33367.1 


gi 


13621640 


gb|AAK33430. 


ai 

y ■ 


13621572 


gb 


AAK33368.1 


gi 


13621642 


gb|AAK33432. 


QI 

y 


13621573 


gb 


AAK33369.1 


gi 


13621644 


gb|AAK33434. 


QI 
y * 


13621574 


gb 


AAK33370.1 


gi 


13621645 


gb 


AAK33435. 


Qi 


13621575 


gb 


AAK33371.1 


gi 


13621647 


gb 


AAK33437. 


Qi 


13621576 


gb 


AAK33372.1 


gi 


13621648 


gb 


AAK33438. 


gi 

w 


13621577 


gb 


AAK33373.1 


g> 


13621650 


gb 


AAK33440. 


gi 


13621579 


gb|AAK33374.1 


gi 


13621651 


gb 


AAK33441. 


gi 


13621581 |gb 


AAK33376.1 


gi 


13621652 


gb 


AAK33442. 


gi 


13621 582|gb 


AAK33377.1 


I gi 


13621657 


gb 


AAK33446. 


gi 


13621583 


gb 


AAK33378.1 


I gi 


13621658 


gb 


AAK33447. 


gi 


13621584 


gb|AAK33379.1 


g» 


13621660|gb 


AAK33449. 


gi 


13621585 


gb 


AAK33380.1 


gi 


13621670 


gb 


AAK33458. 


gi 


13621586 


gb 


AAK33381.1 


gi 


13621671 


gb 


AAK33459. 


gi 


13621588 


gb 


AAK33383.1 


gi 


13621672 


gb|AAK33460. 


gi 


13621589 


gb 


AAK33384.1 


gi 


|13621675 


gb|AAK33462. 


gi 


13621590 


gb 


AAK33385.1 


gi 


13621676 


gb(AAK33463. 


gi 


13621592 


gb 


AAK33386.1 


gi 


1 3621 678|gb|AAK33465. 


gi 


13621593 


gb 


AAK33387.1 


gi 


13621680 


gb|AAK33467. 


gi 


13621594 


gb 


AAK33388.1 


gi 


13621681 


gb|AAK33468. 
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g> 


13621682 


gb 


AAK33469. 


gi 


13621683 


gb 


AAK33470. 


g> 


13621684 


gb 


AAK33471. 


g« 


13621685 


gb 


AAK33472. 




13621688 


gb 


AAK33474. 


gi 


13621689 


gb|AAK33475. 


gi 


13621690 


gb 


AAK33476. 


gi 


13621691 


gb 


AAK33477. 


gi 


13621692 


gb|AAK33478. 


gi 


13621693 


gb|AAK33479. 


gi 


13621694 


gb 


AAK33480. 


gi 


13621695 


gb 


AAK33481 . 


gi 


13621697 


gb 


AAK33483. 


gi 


13621698|gb 


AAK33484. 


gi 


13621700 


gb 


AAK33485. 


gi 


13621701 


gb 


AAK33486. 


gi 


1 3621 702|gb|AAK33487. 


gi 


13621714 


gb 


AAK33498. 


gi 


13621715 


gb 


AAK33499. 


gi 


13621717 


gb 


AAK33501 . 


gi 


13621718 


gb 


AAK33502. 


gi 


13621719 


gb 


AAK33503. 


gi 


13621720 


gb 


AAK33504. 


gi 


13621726 


gb 


AAK33509. 


gi 


13621727 


gb 


AAK33510. 


gi 


13621729 


gb 


AAK33512. 


gi 


13621730 


gb 


AAK33513. 


gi 


13621731 


gb 


AAK33514. 


gi 


13621732 


gb 


AAK33515. 


gi 


13621733 


gb 


AAK33516. 


gi 


13621734 


gb 


AAK33517. 


gi 


13621735 


gb 


AAK33518. 


gi 


13621736 


gb 


AAK33519. 


gi 


13621741 


gb 


AAK33523. 


gi 


13621742|gb 


AAK33524. 


gi 


13621743|gb 


AAK33525. 


gi 


13621744|gb 


AAK33526. 


gi 


13621745 


gb|AAK33527. 


gi 


13621747 


gb|AAK33528. 


gi 


13621756|gb 


AAK33537. 


gi 


13621773 


gb 


AAK33552. 


gi 


13621774 


gb 


AAK33553. 


gi 


13621775 


gb 


AAK33554. 


gi 


13621777 


gb 


AAK33556. 


gi 


13621778 


gb 


AAK33557. 


gi 


13621779 


gb 


AAK33558. 


gi 


13621781 


gb 


AAK33559. 


gi 


113621782 


gb 


AAK33560. 


gi 


|13621785 


gb 


AAK33563. 


gi 


13621786 


gb 


AAK33564. 


gi 


13621787 


gb 


AAK33565. 


gi 


13621788 


gb 


AAK33566. 


gi 


13621789 


gb 


AAK33567. 


gi 


13621790 


gb 


AAK33568. 


gi 


13621793 


gb 


AAK33571 . 


gi 


13621794 


gb 


AAK33572. 



gi|13621796 


gb|AAK33573. 


9' 


13621797 


gb|AAK33574. 


gi 


13621799 


gb 


AAK33576. 


gi 


13621800 


gb 


AAK33577. 


gi 1 13621 802 


gb 


AAK33579. 


gi 


13621806 


gb 


AAK33583. 


gi 


13621808 


gb 


AAK33584. 


gi 


13621809 


gb 


AAK33585. 


gi 


13621810 


gb|AAK33586. 


gi 


13621811 


gb 


AAK33587. 


gi 


13621812 


gb 


AAK33588. 


gi 


13621813 


gb 


AAK33589. 


gi 


1 3621 814|gb|AAK33590. 


gi 


1 3621 81 7|gb|AAK33592. 


gi 


13621818 


gb|AAK33593. 


gi 


13621819 


gb|AAK33594. 


gi 


1 3621 820|gb|AAK33595. 


gi 


13621821 


gb|AAK33596. 


gi 


13621822 


gb|AAK33597. 


gi 


13621823 


gb|AAK33598. 


gi 


13621824 


gb|AAK33599. 


gi 


13621825 


gb|AAK33600. 


gi 


13621826 


gb|AAK33601. 


gi 


13621828 


gb 


AAK33602. 


gi 


13621829 


gb 


AAK33603. 


gi 


13621830 


gb 


AAK33604. 


gi 


13621831 


gb 


AAK33605. 


gi 


13621834 


gb 


AAK33608. 


gi| 


13621835 


gb 


AAK33609. 


gi; 


13621836 


gb 


AAK33610. 


gi 


13621837 


gb 


AAK33611. 


gi! 


13621839 


gb|AAK33612. 


gi^ 


13621840 


gb|AAK33613. 


gi 1 


13621841 


gb|AAK33614. 


gi 


13621842 


gb|AAK33615. 


gi 


13621843 


gb 


AAK33616. 


gi 


13621844 


gb 


AAK33617. 


gi 


13621898 


gb|AAK33667. 


gi 


13621901 


gb|AAK33670. 


gi 


13621902 


gb|AAK33671. 


gi 


13621904 


gb|AAK33672. 


gi 


13621907 


gb 


AAK33675. 


gi 


13621908 


gb 


AAK33676. 


gi 


13621909 


gb 


AAK33677. 


gi 


13621910 


gb 


AAK33678. 


gi 


13621912 


gb 


AAK33680. 


gi 


13621924 


gb 


AAK33690. 


gi 


13621929 


gb 


AAK33694. 


g» 


13621930 


gb 


AAK33695. 


gi 


13621931 


gb 


AAK33696. 


gi 


13621933 


gb 


AAK33698. 


gi 


13621934 


gb 


AAK33699. 


gi 


13621935 


gb 


AAK33700. 


g» 


13621936 


gb 


AAK33701. 


gi 


13621937 


gb 


AAK33702. 


gi 


113621938 


gb 


AAK33703. 
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*j 


13621939 


gb 


AAK33704.1 


gi 


13621942 


gb 


AAK33706.1 


gi 1 


13621944 


gb|AAK33708.1 


gi 


13621945 


gb|AAK33709.1 


g> 


1 3621 946|gb|AAK3371 0.1 1 


gij 


1 3621 950|gb|AAK3371 4. 1 


g> 


13621953 


gb|AAK33716.1 


Qi 


13621954 


gb 


AAK33717.1 


Qi 


13621955 


gb 


AAK33718.1 


53* 


13621956 


gb 


AAK33719.1 


oi 


13621957 


gb|AAK33720.1| 


gi 


13621958 


gb 


AAK33721.1| 


gij 


13621959 


gb 


AAK33722.1| 


gi| 


13621961 


gb 


AAK33723.1| 


oil 


13621975 


gb 


AAK33736.1 


gi] 


13621977 


gb|AAK33738.1 


gi 


13621978 


gb|AAK33739.1 


Qi 

53" 


13621979 


gb|AAK33740.1 


gi 


13621980 


gb|AAK33741.1 


Qi 


13621981 


gb|AAK33742.1| 


Qi 


13621982 


gb 


AAK33743.1 


qi 


13621985 


gb 


AAK33745.1 


Qi 

53 


13621986 


gb 


AAK33746.1I 


Qi 


13621987 


gb 


AAK33747.1| 


Qi 

53* 


13621989|gb 


AAK33749.1 


Qi 

53* 


13621990|gb 


AAK33750.1| 


Qi 

53* 


13621992|gb 


AAK33752.1! 


gi 


13621993 gb 


AAK33753.1 


Qi 

53 


13621994 


gb|AAK33754.1 


gi 


13621996 


gb|AAK33755.1 


gi 


13621997 


gb 


AAK33756.1 


gi 


13621998|gb 


AAK33757.1 


Qi 

53* 


13621999 


gb 


AAK33758.1 


Qi 

53 


13622000 


gb 


AAK33759.1 


QI 

53 


13622001 


gb 


AAK33760.1 


Qi 

53' 


13622002 


gb 


AAK33761.1 


gi 

53' 


13622003 


gb 


AAK33762.1 


gi 

53 


13622004 


gb 


AAK33763.1 


g> 


13622005 


gb 


AAK33764.1 


gi 


13622006 


gb 


AAK33765.1 


gi 


1 3622008 


gb|AAK33766.1 


gi 


13622009 


gb 


AAK33767.1 


gi 


1 362201 0|gb 


AAK33768.1 


gi 


1 362201 2|gb 


AAK33770.1 


gi 


1 362201 3|gb|AAK33771 .1 


gi 


13622017 


gb 


AAK33774.1 


gi 


13622018 


gb 


AAK33775.1 


gi 


13622019 


gb 


AAK33776.1 


gi 


13622020 


gb 


AAK33777.1 


gi 


13622021 


gb 


AAK33778.1 


gi 


|13622024 


gb 


AAK33781.1 


gi 


|13622025 


gb 


AAK33782.1 


gi 


13622026 


gb|AAK33783.1 


gi 


13622031 


gb 


AAK33787.1 


gi 


1 3622032 


gb 


AAK33788.1 


gi 


1 3622033 


gb 


AAK33789.1 



gi 


13622034|gb 


AAK33790. 


gi 


13622035 


gb 


AAK33791. 


gi 


1 3622039 


gb 


AAK33794. 


gi 


13622041 


gb 


AAK33796. 


gi 

w I 


13622042 


gb 


AAK33797. 


gi| 


1 3622043|gb|AAK33798. 


gi 


1 3622044 


gb 


AAK33799. 


gi 


1 3622045 


gb 


AAK33800. 


gi 


13622046 


gb 


AAK33801. 


gi 


1 3622048 


gb 


AAK33802. 


gi 


13622049 


gb 


AAK33803. 


gi 


13622050 


gb 


AAK33804. 


gi 


13622051 


gb 


AAK33805. 


gi 


1 3622052 


gb 


AAK33806. 


gi 


13622054 


gb 


AAK33808. 


gi 


1 3622055 


gb 


AAK33809. 


gi 


1 3622056 


gb 


AAK33810. 


gi 

*w 


13622058 


gb 


AAK33812. 


gi 


1 3622060 


gb 


AAK33813. 


gi 


1 3622062 


gb|AAK33815. 


gi I 

i 


1 3622064 


gb 


AAK33817. 


gi] 


1 3622065 


gb 


AAK33818. 


gi I 


1 3622068 


gb 


AAK33821. 


gi 


1 3622069 


gb 


AAK33822. 


gi 


1 3622070 


gb 


AAK33823. 


gi 


13622071 


gb 


AAK33824. 


gi 

*^» 


1 3622073 


gb 


AAK33825. 


gi 


1 3622074 


gb 


AAK33826. 


gi 


1 3622075 


gb 


AAK33827. 


gi] 


13622077 


gb 


AAK33829. 


gi 


1 3622079 


gb 


AAK33831 . 


gi 


13622083 


gb 


AAK33834. 


gi 

*v^ 


1 3622085 


gb 


AAK33836. 


gi 


1 3622086 


gb 


AAK33837. 


gi 


1 3622087 


gb 


AAK33838. 


gi 


1 3622088 


gb|AAK33839. 


gi 


13622089 


gb|AAK33840. 


gi 


1 3622090 


gb 


AAK33841. 


gi 


13622091 


gb 


AAK33842. 


gi 


1 3622092 


gb|AAK33843. 


gi 


13622093 


gb|AAK33844. 


gi 


13622095 


gb 


AAK33845. 


gi 


1 3622096 


gb 


AAK33846. 


gi 


13622097 


gb 


AAK33847. 


gi 


13622162 


gb 


AAK33908. 


gi 


1 36221 63|gb|AAK33909. 


gi 


13622164 


gb|AAK33910. 


gi 


13622165 


gb|AAK33911. 


gi 


113622166 


gb 


AAK33912. 


gi 


13622169 


gb 


AAK33914. 


gi 


13622170 


gb 


AAK33915. 


gi 


13622171 


gb 


AAK33916. 


gi 


13622172 


gb 


AAK33917. 


gi 


13622174 


gb 


AAK33919. 


gi 


13622175 


gb 


AAK33920. 


gi 


13622176 


gb 


AAK33921. 
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13622177 


gb 


AAK33922.1 


gi 


13622269|gb|AAK34006. 


9' 


13622179 


gb 


AAK33923.1 


gi 


1 3622271 |gb|AAK34007. 




1 36221 80|gb 


AAK33924.1 


gi 


1 3622272|gb|AAK34008. 


gi 


13622181 


gb 


AAK33925.1 


gi 


1 3622273 |gb|AAK34009. 


gi 


13622182 


gb|AAK33926.1 


gi 


1 3622274|gb|AAK3401 0. 


gi 


13622183 


gb|AAK33927.1 


gi 

57 


13622275|gb|AAK34011. 




1 36221 84|gb|AAK33928.1 


gi 


1 3622276 


gb 


AAK34012. 


gi 


13622185 


gb|AAK33929.1 


gi 


1 3622277 


gb 


AAK34013. 


gi 

57 


13622186 


gb|AAK33930.1 


gi 


1 3622278 


gb 


AAK34014. 


gi 

57 


13622189 


gb|AAK33932.1 


gi 

57 


1 3622279 


gb 


AAK34015. 


gi 

57* 


1 3622 1 90 |g b | AAK33933 . 1 


gi 

57 


13622281 


gb|AAK34017. 


gi 

57 


13622191 


gb|AAK33934.1 


gi 

57" 


1 3622282 


gb|AAK34018. 


gi 

5? 


13622192 


gb|AAK33935.1 


gi 

57 


1 3622283 


gb 


AAK34019. 


qi 

57 


13622198 


gb|AAK33940.1 


gi 


1 3622284 


gb 


AAK34020. 


gi 

57" 


13622200|gb 


AAK33942.1 


gi 

57 


1 3622285 


gb 


AAK34021. 


gi 


13622201 


gb 


AAK33943.1 


gi 


1 3622287 


gb 


AAK34022. 


gi 


13622204 


gb 


AAK33946.1 


gi 


1 3622288 


gb 


AAK34023. 


gi 


13622205 


gb 


AAK33947.1 


gi 


1 3622289 


gb 


AAK34024. 


gi 

57 


13622207 


gb 


AAK33949.1 


gi 


1 3622290 


gb 


AAK34025. 


gi 

57 


13622208 


gb|AAK33950.1| 


gi 

57 


1 3622294 


gb 


AAK34029. 


gi 


13622211 


gb 


AAK33952.1 


gi 


1 3622295 


gb 


AAK34030. 


gi 


13622213 


gb 


AAK33954.1 


gi 


1 3622296 


gb 


AAK34031 . 


gi 


13622214 


gb 


AAK33955.1 


gi 


1 3622297 


gb 


AAK34032. 


gi 


13622215 


gb 


AAK33956.1 


gi 


1 3622298 


gb 


AAK34033. 


g« 


13622216 


gb 


AAK33957.1 


gi 


13622299 


gb 


AAK34034. 


gi 


13622217 


gb 


AAK33958.1 


gi 


13622301 


gb 


AAK34035. 


gi 


13622218 


gb 


AAK33959.1 


gi 


13622306 


gb 


AAK34040. 


gi 


13622219 


gb 


AAK33960.1 


gi 


13622326 


gb 


AAK34058. 


gi 


13622222|gb 


AAK33962.1 


gi 


1 3622328 |gb 


AAK34060. 


gi 


13622223 


gb 


AAK33963.1 


gi 


13622329 


gb|AAK34061 . 


gi 


13622224 


gb|AAK33964.1| 


gi 


1 3622330 


gb 


AAK34062. 


gi 


1 3622233 


gb 


AAK33972.1 


gi 


1 3622332 


gb 


AAK34064. 


gi 


13622235 


gb 


AAK33974.1 


gi 


1 3622333 


gb 


AAK34065. 


gi 


13622236 


gb 


AAK33975.1 


gi 


13622335|gb 


AAK34066. 


gi 


13622237 


gb|AAK33976.1 


gi 


1 3622338 


gb 


AAK34069. 


gi 


13622239 


gb|AAK33978.1 


gi 


1 3622339 


gb 


AAK34070. 


gi|1 3622240 


gb|AAK33979.1 


gi 


13622340 


gb 


AAK34071 . 


gi 


13622241 


gb|AAK33980.1 


gi 


13622341 


gb 


AAK34072. 


gi 


13622242 


gb|AAK33981.1 


gi 


13622343 


gb 


AAK34073. 


gill 3622243 


gbIAAK33982.1 


gi 


13622350 


gb 


AAK34080. 


gi 


1 3622244|gb|AAK33983. 1 


gi 


13622351 


gb|AAK34081. 


gi 


13622250|gbjAAK33988.1 


gi 


1 3622352 


gb|AAK34082. 


gi 


1 3622252 


gb|AAK33990.1 


gi 


13622353 


gb 


AAK34083. 


gi 


13622253 


gb|AAK33991.1 


gi 


1 3622355 


gb 


AAK34084. 


gi 


13622255 


gb|AAK33993.1 


gi 


1 3622356 


gb 


AAK34085. 


gi 


1 3622256 


gb|AAK33994.1 


gi 


13622357 


gb 


AAK34086. 


gi 


|13622257 


gb 


AAK33995.1 


gi 


13622358|gb 


AAK34087. 


gi 


13622259 


gb 


AAK33996.1 


I gi 


13622359 


gb 


AAK34088. 


gi 


1 3622260 


gb|AAK33997.1 


gi 


13622360 


gb 


AAK34089. 


gi 


13622261 


gb|AAK33998.1 


gi 


13622361 


gb 


AAK34090. 


gi 


113622262 


gb|AAK33999.1 


gi 


13622362 


gb 


AAK34091. 


gi 


|13622263|gb 


AAK34000.1 


gi 


1 3622363 


gb 


AAK34092. 


gi 


|13622264|gb 


AAK34001.1 


gi 


1 3622364 [gb 


AAK34093. 


gi 


13622265 


gb|AAK34002.1 


gi 


1 3622366 


gb 


AAK34094. 


gi 


1 3622266 


gb|AAK34003.1 


gi 


1 3622367 


gb 


AAK34095. 


gi 


|13622268 


gb|AAK34005.1 


gi 


1 3622368 


gb 


AAK34096. 
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1 3622369 


gb 


AAK34097.1 


gi 


13622370 


gb 


AAK34098.1 


gi 


13622371 


gb|AAK34099.1| 


gi 


13622372|gb|AAK34100.1 


gi 


1 3622373 


gb|AAK34101.1| 


gi 


13622374 


gb|AAK341 02.1| 


gi 

53 


13622375 


gb 


AAK341 03.1| 


gi 


1 3622376 


gb 


AAK34104.1 


Qi 


13622377 


gb 


AAK34105.1 


gi 


1 3622378 


gb 


AAK34106.1 


Qi 


13622380 


gb|AAK34107.1 


Qi 


1 3622383 


gb|AAK34110.1| 


Qi 

53 


13622384 


gb|AAK341 11.1| 


gi 


13622387|gb 


AAK34114.1] 


gi 


1 3622389 


gb 


AAK34116.1] 


gi 


1 3622394 


gb 


AAK34120.1| 


gi 


13622395 


gb 


AAK34121.1 


gi 


13622396 


gb 


AAK34122.1 


Qi 


13622398 


gb 


AAK34124.1 


Qi 

53 


13622399 


gb 


AAK34125.1| 


Qi 

53" 


13622400 


gb 


AAK34126.1] 


Qi 

53 


1 3622401 


gb 


AAK34127.1] 


gi 

53 


1 3622403 


gb 


AAK34128.1 


gi 


13622405 


gb 


AAK34130.1 


gi 


1 3622406 


gb 


AAK34131.lj 


gi 


1 3622407 |gb 


AAK34 132.1 


gi 


1 3622408 jgb 


AAK34133.1 


gi 

53 


1 362241 5|gb|AAK341 39.1 1 


gi 

53" 


13622416|gb|AAK34140.1| 


gi 

53" 


13622417|gb|AAK34141 .1 1 


Qi 

53 


13622419|gb|AAK34143.1| 


gi 


13622420 


gb 


AAK34144.1 


gi 


13622424 


gb 


AAK34147.1 


gi 


13622425 


gb 


AAK34148.1 


gi 


13622431 


gb 


AAK34153.1 




13622432 


gb 


AAK34154.1 


gi 


13622433|gb 


AAK34155.lj 


gi 


13622434 jgb 


AAK34156.1| 


gi 


13622435 jgb 


AAK34157.1 


gi 


13622436 


gb 


AAK34158.1 


gi 


13622437 


gb 


AAK34159.1| 


gj 


13622444 


gb 


AAK34165.1| 


gi 


1 3622447 


gb 


AAK34168.1 


gi 


13622450 


gb 


AAK34170.1 


gi 


13622451 


gb|AAK341 71.1| 


gi 


13622455|gb 


AAK34175.1| 


gi 


13622457jgb 


AAK34177.1| 


gi 


13622458jgb 


AAK34178.1 


gi 


13622460 jgb 


AAK34179.1 


gi 


13622461 


gb 


AAK34180.1 


gi 


13622462 


gb 


AAK34181.1 


g« 


1 3622463 


gb 


AAK34182.1 


gi 


1 3622464 


gb 


AAK34183.1 


gi 


1 3622465 


gb 


AAK34184.1| 


gi 


1 3622467 


gb 


AAK34186.1 


gi 


13622468 


gb 


AAK34187.1| 



gi 


113622471 


gb 


AAK34189.1 


gi 


113622473 


gb 


AAK34191.1 


gi 


1 3622474 


gb 


AAK34192.1 


gi 


13622477 


gb 


AAK34195.1 


gi 


1 3622478 


gb 


AAK34196.1| 


gi 


113622479 


gb 


AAK34197.1 


gi 


13622481 


gb 


AAK34198.1 


gi 


113622482 


gb 


AAK34199.1 


gi 


1 3622483 


gb 


AAK34200.1 


gi 


13622484 


gb 


AAK34201.1 


g» 


13622485 


gb|AAK34202.1 


gi 


113622486 


gb 


AAK34203 1 


gi 


13622491 


gb 


AAK34207 1 

m %# ml X*«^ 1 mm* \^ 1*1 


gi 


1 1 3622492 |gb 


AAK34208 1 


gi 


1 3622493 


gb 


AAK34209 1 


gi 


13622494 


gb 


AAK34210.1 

# ** » * ^ m Aril m • 9 


gi 


13622495 


gb 


AAK34211.1 


gi 


13622496 


gb 


AAK34212.1 


gi 


13622497 


gb 


AAK34213.1 


gi 


13622499 


gb 


AAK34214.1 


gi 


1 3622500 


gb 


AAK34215.1 


gi 


13622501 


gb 


AAK34216.1 


gi 


13622506 


gb 


AAK34221.1| 


gi 


1 3622507 


gb 


AAK34222.1| 


gi 


13622508 


gb 


AAK34223.1 


gi 


1 3622509 


gb 


AAK34224.1 


gi 


13622511 


gb 


AAK34225.1 


gi 


13622512 


gb 


AAK34226.1 


gi 


13622513 


gb 


AAK34227.1 


gi 


13622515 


gb 


AAK34229.1 


gi 


1 362251 6|gb 


AAK34230.1 


gi 


13622517 


gb 


AAK34231.1 


gi 


13622518 


gb 


AAK34232.1 


gi 


1 3622520 


gb 


AAK34233.1 


gi 


13622521 


gb 


AAK34234.1 


g» 


1 3622523 


gb 


AAK34236.1 


gi 


1 3622524 


gb 


AAK34237.1 


gi 


1 3622525 


gb 


AAK34238.1 


gi 


13622526 


gb 


AAK34239.1 


gi 


1 3622527 


gb 


AAK34240.1 


gi 


1 3622579|gb 


AAK34289.1 


gi 


13622583 


gb 


AAK34292.1 


gi 


13622585 


gb 


AAK34294.1 


gi 


1 3622587 


gb 


AAK34296.1 


gi 


1 3622588 


gb 


AAK34297.1 


gi 


13622590 


gb 


AAK34299.1| 


gi 


13622591 


gb 


AAK34300.1| 


gi 


13622593 


gb 


AAK34301.1| 


gi 


13622595 


gb 


AAK34303.1 


gi 


1 3622596 


gb 


AAK34304.1 


gi 


13622597 


gb 


AAK34305.1 


gi 


13622598 


gb 


AAK34306.1 


gi 


|13622599|gb 


AAK34307.1 


gi 


1 1 3622600 


gb 


AAK34308.1 


gi 


113622601 


gb 


AAK34309.1 


gi 


(13622603 


gb 


AAK34310.1 
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oil 

y 1 


1 3622604 


gb 


AAK34311.' 


ai 
y 1 1 


13622606 


gb 


AAK34313.* 


ai 
y 1 


13622607 


gb 


AAK34314.' 


ai 


13622608|gb 


AAK34315.' 


ai 


1 3622609|gb|AAK3431 6.' 


ai 
y* 


13622610 


gb|AAK34317.' 


ai 
y* 


13622611 


gb 


AAK34318 ' 


ai 
y 1 


13622612 


gb 


AAK34319 ' 


ai 
y» 


13622615 


gb 


AAK34321 ' 


ai 

y • 


13622616 


gb 


AAK34322.' 

/ v* vi w r • 


ai 
y' 


13622617 


gb 


AAK34323." 


ail 


13622618 


gb 


AAK34324.' 


ai 
y' 


13622621 


gb 


AAK34327 ' 


ai 
y' 


1 3622622 


gb 


AAK34328 ' 


ai 
y 1 


1 3622623 


gb 


AAK34329 ' 


ai 

y' 


1 3622624 


gb 


AAK34330 ' 


ail 

y*i 


13622625 


gb 


AAK34331.* 


ai 


13622626 


gb 


AAK34332 


ai 


1 3622628 |gb 


AAK34333 ' 


ail 
y'l 


1 3622629 


gb 


AAK34334 


ai 

y* 


13622630 


gb 


AAK34335 


ai 
y* 


13622631 


gb 


AAK34336 


ai 


13622632|gb 


AAK34337 


ai 

y'i 


13622634 


gb 


AAK34339 


ai 

y*, 


13622636 


gb 


AAK34341. 


q\ 

y 1 ; 


1 3622640 


gb 


AAK34344. 


ai 
y* 


13622641 


gb 


AAK34345. 


ai 
y* 


13622652|gb 


AAK34355. 


ai 
y* 


13622653|gb 


AAK34356. 


ai 


13622654|gb 


AAK34357. 


ai 


13622656|gb 


AAK34359. 


ai 


13622660 


gb 


AAK34363. 


ai 
y 1 


13622665 


gb 


AAK34367. 


ai 
y 1 


1 3622668 


gb 


AAK34370. 


ai 
y 1 


13622675 


gb 


AAK34376. 


ai 
y* 


1 3622676 |gb 


AAK34377. 


ai 
y 1 


113622683 


gb 


AAK34383. 


ai 

y * 


1 3622684 


gb 


AAK34384. 


ai 

53' 


13622685 


gb 


AAK34385. 


qi 


13622688 


gb 


AAK34387. 


ai 


13622689 


gb 


AAK34388. 


ai 

y • 


13622690 


gb 


AAK34389. 


9' 


13622691 


gb 


AAK34390. 


gi 


13622692|gb 


AAK34391. 


gi 


1 3622693 jgb 


AAK34392. 


gi 


13622694|gb|AAK34393. 


gi 


1 3622695 


gb 


AAK34394. 


gi 


1 3622696 


gb 


AAK34395. 


gi 


13622698 


gb 


AAK34396. 


gi 


13622699 


gb 


AAK34397. 


gi 


13622700 


gb|AAK34398. 


gi 


13622701 


gb|AAK34399. 


gi 


[13622702 


gb|AAK34400. 


gi 


113622703 


gb|AAK34401 . 


gi 


|13622704 


gb 


AAK34402. 


gi 


113622705 


gb 


AAK34403. 



gi 


13622711 


gb 


AAK34408. 


• 


13622713 


gb 


AAK34410. 


Qi 


1 362271 4|gb 


AAK3441 1 . 


Qi 


1 362271 5|gb 


AAK34412. 


Qi 

53 


1 362271 8|gb|AAK3441 4. 


oil 


1 362271 9|gb|AAK3441 5. 


QI | 


1 3622720|gb|AAK3441 6. 


Qi 

53' 


1 3622721 |gb|AAK3441 7. 


qi 

53* 


1 3622722 


gb 


AAK34418. 


qi 

53* 


1 3622723 


gb 


AAK34419. 


qi 

53' I 


13622727 


gb 


AAK34422. 


qi 

53* I 


1 3622728 


gb 


AAK34423. 


qil 

53' I 


13622729|gb 


AAK34424. 


ai 

53'! 


13622730|gb 


AAK34425. 


qi 

53' 


13622731 gb 


AAK34426. 


gi 


13622733 


gb 


AAK34428. 


gi 


1 3622734 


gb 


AAK34429. 


gi 


13622735 


gb 


AAK34430. 


gi| 


13622736|gb 


AAK34431. 


gi| 


13622737 


gb 


AAK34432. 


gi| 


1 3622740 


gb 


AAK34434. 


gi| 


13622741 


gb 


AAK34435. 


gi 


1 3622742 


gb 


AAK34436. 


gi 


1 3622744 


gb 


AAK34438. 


gi 


1 3622745 


gb 


AAK34439. 


gi 


1 3622746 


gb 


AAK34440. 


gj 


1 3622749 


gb 


AAK34442. 


gi 


1 3622750 


gb 


AAK34443. 


gi 


13622751 


gb|AAK34444. 


gi 


13622752 


gb|AAK34445. 


gi 1 13622753 


gb 


AAK34446. 


gi 1 13622754 


gb 


AAK34447. 


gi| 13622760 


gb 


AAK34452. 


gii 13622762 


gb 


AAK34454. 


gil 


1 3622763 


gb 


AAK34455. 


gi 


1 3622764 


gb 


AAK34456. 


g» 


1 3622765 


gb 


AAK34457. 


gi 


1 3622766 


gb 


AAK34458. 


gi 


13622767 


gb 


AAK34459. 


gi 


1 3622768 


gb 


AAK34460. 


gi 


1 3622770 


gb 


AAK34462. 


gi 


1 3622771 


gb 


AAK34463. 


gi 


1 3622774 


gb 


AAK34465. 


gi 


13622775 


gb 


AAK34466. 


gi 


1 3622776 


gb 


AAK34467. 


g« 


1 3622777 


gb 


AAK34468. 


gi 


13622778 


gb 


AAK34469. 


gi 


1 3622779 


gb 


AAK34470. 


gi 


1 3622780 


gb 


AAK34471. 


gi 


13622781 


gb 


AAK34472. 


gi 


1 3622782 


gb 


AAK34473. 


gi 


13622783 


gb 


AAK34474. 


gi 


|13622785 


gb 


AAK34475. 


gi 


1 3622787 


gb|AAK34477. 


gi 


13622789 


gb 


AAK34479. 


gi 


|13622790 


gb 


AAK34480. 
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gi 


13622791 |gb 


AAK34481.1 


g> 


13622792|gb 


AAK34482.1 


gi 


1 3622793 |gb 


AAK34483.1| 


gi 

57 


1 3622794 jgb 


AAK34484.1| 


oil 

57 1 


13622795 


gb|AAK34485.1| 


di 


1 3622796 


gb|AAK34486.1 


gi 

55" 


1 3622798 


gb 


AAK34487.1 


gi 

53" 


1 3622799 


gb 


AAK34488.1 


gi 

55* 


13622800 


gb 


AAK34489.1 


gi 


13622801 


gb 


AAK34490.1 


gi 

*W 


1 3622802 


gb 


AAK34491.1 


Qi 

55" 


13622803|gb 


AAK34492.1 


oil 

55*1 


13622804|gb 


AAK34493.1 


qi 

55 1 


13622805|gb 


AAK34494.1 


gi 


1 3622806 


gb|AAK34495.1| 


g' 


1 3622807 


gb 


AAK34496.1] 


gi 


13622808 


gb 


AAK34497.1 


gi 


13622809 


gb 


AAK34498.1 


gi 


13622810 


gb 


AAK34499.1 


gi 


13622812 


gb 


AAK34500.1| 


gi 


13622813 


gb 


AAK34501.1) 


gi 


13622814 


gb 


AAK34502.1 j 


gi 


1 362281 5|gb 


AAK34503.1 1 


gi 


1 362281 8|gb|AAK34506.1 1 


gi 

55 


13622821 


gb 


AAK34509.1| 


qi 

55 


1 3622822 


gb 


AAK34510.1| 


gi 


1 3622823 


gb 


AAK34511.1| 


gi 


13622825 


gb 


AAK34512.1 


qi 


1 3622826 |gb| AAK3451 3 . 1 


qi 

55 


1 3622827 


gb|AAK34514.1 


gi 

55* 


13622828 


gb|AAK34515.1 


qi 

55 


1 3622829 


gb|AAK34516.1 


gi 

55 


1 3622830 


gb 


AAK34517.1| 


qi 

55* 


1 3622833 |gb 


AAK34520.1 


gi 


13622838|gb 


AAK34524.1 


gi 

55 


|13622839|gb 


AAK34525.1 


gi 


|13622840|gb 


AAK34526.1 


gi 


13622841 


gb 


AAK34527.1 


gi 


13622847 


gb 


AAK34532.1 


gi 


13622848[gb 


AAK34533.1 


gi 


13622849|gb 


AAK34534.1 


gi 


1 3622853 


gb 


AAK34537.1 


gi 


1 3622854 


gb 


AAK34538.1 


gi 


1 3622856 


gb 


AAK34540.1 


gi 


1 3622857 


gb 


AAK34541.1 


gi 


1 3622858 


gb 


AAK34542.1| 


gi 


113622860 


gb 


AAK34543.1 


gi 


13622861 


|gb 


AAK34544.1 


gi 


1 3622862 


gb 


AAK34545.1 


gi 


1 3622863 


gb 


AAK34546.1 


gi 


1 3622864 


gb 


AAK34547.1 


gi 


1 3622865 


gb 


AAK34548.1 


gi 


1 3622866 


gb 


AAK34549.1 


gi 


1 3622867 


gb|AAK34550.1 


gi 


13622868|gb 


AAK34551.1 


gi 


|13622869|gb 


AAK34552.1 



9' 


13622870 


gb 


AAK34553. 


gi 


1 3622873 


gb 


AAK34555. 


gi 


13622875 


gb 


AAK34557. 


g> 


13622876|gb 


AAK34558. 


gi 


13622877|gb 


AAK34559. 


g> 


13622878 


gb 


AAK34560. 


gi 


1 3622879 


gb 


AAK34561 . 


gi 


13622880|gb 


AAK34562. 


gi 

55 


13622881 


gb 


AAK34563. 


gi 


13622882 


gb 


AAK34564. 


gi 

57 


13622885|gb 


AAK34566. 


gi| 


1 3622886 


gb 


AAK34567. 


gi 

55 I 


13622887 


gb|AAK34568. 


oil 

55 I 


13622888|gb|AAK34569. 


gi 

53 


13622890|gb 


AAK34571 . 


gi 


13622893|gb 


AAK34574. 


gi 


13622896 


gb 


AAK34576. 


gi 


13622898 


gb 


AAK34578. 


gi 


1 3622899 [gb 


AAK34579. 


gi 


13622900|gb 


AAK34580. 


gi 


13622901 Igb 


AAK34581. 


gi| 


13622903 


gb 


AAK34583. 


gi| 


13622905 


gb 


AAK34585. 


g'l 


13622906 


gb|AAK34586. 


oil 

C I 


13622907 


gb|AAK34587. 


gi 


13622908 


gb 


AAK34588. 


gi 


13622910 


gb 


AAK34589. 


g'l 


1 362291 1|gb 


AAK34590. 


gi| 


1 362291 2 jgb 


AAK34591 . 


gi| 


13622913 


gb|AAK34592. 


gi| 


13622914 


gb|AAK34593. 


qi 

57 


13622915 


gb|AAK34594. 


gi 


13622917 


gb|AAK34596. 


gi 


13622918 


gb 


AAK34597. 


gi 

55 1 


13622919 


gb 


AAK34598. 


gi! 


13622921 


gb 


AAK34599. 


gi 


1 3622922 


gb|AAK34600. 


gi 


13622924 


gb 


AAK34602. 


gi 

•w* 


13622925 


gb 


AAK34603. 


gi 


1 3622926 


gb 


AAK34604. 


gi 


13622927 


gb|AAK34605. 


gi 


13622928|gb 


AAK34606. 


gi 


13622929 


gb 


AAK34607. 


g> 


13622930 


gb 


AAK34608. 


gi 


13622931 


gb 


AAK34609. 


gi 


1 3622933 


gb|AAK34610. 


gi 


13622941 


gb|AAK34617. 


gi 


1 3622944 


gb 


AAK34620. 


gi 


13622945|gb 


AAK34621 . 


gi 


13622947|gb 


AAK34623. 


gi 


1 3622948 


gb 


AAK34624. 


gi 


1 3622949 


gb 


AAK34625. 


gi 


1 3622950 


gb 


AAK34626. 


gi 


13622952|gb 


AAK34627. 


gi 


13622955 


gb 


AAK34630. 


gi 


13622956 


gb 


AAK34631 . 
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w 


13622959|gb 


AAK34634.1] gi 


1 3623083 


gb 


AAK34746. 


qi 


1 3622961 Igb 


AAK34636.1| gi 


1 3623085 


gb 


AAK34747. 


Qi 

y 


13622963|gb|AAK34638.1 1 gi 


13623086 


gb 


AAK34748. 


Qi 

53' 


13622964|gb|AAK34639.1 j gi 


1 3623088 


gb 


AAK34750. 


gi 


13622967|gb 


AAK3 4641.1| gi 


13623089)gb|AAK34751 . 


Qi 


1 3622969 |gb 


AAK34643.1 


ai 

y 


13623090|gb 


AAK34752. 


gi 

53 


1 3622971 |gb|AAK34645.1 


ai 

y* 


13623091 


Igb 


AAK34753. 


Qi 


1 3622973|gbjAAK34647.1 


ai 


13623093 


gb 


AAK34755. 


Qi 

53' 


1 3622974|gb 


AAK34648.1 


ai 


1 3623095 


gb 


AAK34756. 


Qi 

53' 


13622977[gb 


AAK34651.1 


ai 
y- 


13623096 


gb 


AAK34757. 


Qi 

53 


13622981 igb 


AAK34654.1 


ai 
y- 


13623098 


gb 


AAK34759. 


ai 

53* 


13622982|gb 


AAK34655.1 


gi 


13623099 


gb 


AAK34760. 


ai 

v3' 


13622983|gb 


AAK34656.1 


gi 


13623100 


gb 


AAK34761. 


ai 

53 1 


13622984|gb 


AAK34657.1 


gi 


13623102 


gb 


AAK34763. 


Qi 
y 


1 3622985 


gb 


AAK34658.1 


gi 


13623103 


gb 


AAK34764. 


ai 

53' 


1 3622989 


gb 


AAK34661.1 


gi 


13623105 


gb 


AAK34766. 


gi 

53* 


1 3622990 


gb 


AAK34662.1 


gi 


13623107 


gb 


AAK34767. 


ai 

53' 


13622991 


gb 


AAK34663.1 


g> 


13623128 


gb 


AAK34787. 


ai 

y 


13622992|gb 


AAK34664.1 


gi 


13623129 


gb 


AAK34788. 


gi 

y 


1 1 3622995 |gb | AAK34666 . 1 


gi 


13623131 


gb|AAK34790. 


ai 

y 


1 3622996 


gb|AAK34667.1 


gi 


13623132 


gb 


AAK34791. 


ai 

y 


1 3622998 


gb|AAK34669.1 


g« 


13623133 


gb 


AAK34792. 


Qi 

53 


1 3622999 


gb|AAK34670.1 


gi 


13623134 


gb 


AAK34793. 


gi 

53 


1 3623000 


gb|AAK34671.1 


gi 


136231 36|gb|AAK34794. 


ai 

y 


13623001 


gb 


AAK34672.1 


gi 


13623138 


gb 


AAK34796. 


Qi 

53' 


13623002 


gb 


AAK34673.1 


gi 


13623139 


gb 


AAK34797. 


gi 

53 


1 3623004 


gb 


AAK34674.1 


g« 


13623150 


gb 


AAK34807. 


ai 

y 


1 3623005 


gb 


AAK34675.1 


gi 


13623151 


gb 


AAK34808. 


ai 

y 


13623006 


gb 


AAK34676.1 


gi 


13623152 


gb 


AAK34809. 


Qi' 

y 


13623007|gb 


AAK34677.1 


gi|1 36231 54 


gb|AAK3481 1 . 


ai 

y 


1 3623009 


gb 


AAK34679.1 


gi|1 36231 55 


gb 


AAK34812. 


Qi 

53 


13623019 


gb 


AAK34688.1 


gi 


13623156 


gb 


AAK34813. 


Qi 

53 ' 


1 3623020 


gb 


AAK34689.1 


gi 


136231 57|gb 


AAK34814. 


ai 

y 


13623030 


gb 


AAK34698.1 


gi 


1 36231 59|gb 


AAK3481 5. 


qi 

y 


13623031 


gb|AAK34699.1 


gi 


13623161] 


gb 


AAK34817. 


ai 

y 


13623032 


gb|AAK34700.1 


gi 


1 36231 62|gb 


AAK34818. 


qi 

53 


13623033 


gbiAAK34701.1 


gi|1 36231 63 


gb 


AAK34819. 


gi 

53 


13623038 


gb 


AAK34705.1 


gi 


13623165 


gb 


AAK34821. 


gi 

53 


13623045 


gb 


AAK34712.1 


gi 1 13623 166 


gb|AAK34822. 


gi 


13623046 


gb|AAK34713.1 


gi|13623167 


gb|AAK34823. 


gi 


13623047 


gb 


AAK34714.1 


gi 13623168 


gb 


AAK34824. 


gi 

53 


1 3623049 


gb 


AAK34715.1 


gi 


1 36231 70|gb 


AAK34826. 


gj 


1 3623050 


gb 


AAK34716.1 


gi 


13623171 


gb 


AAK34827. 


gi 


1 3623051 


gb 


AAK34717.1 


gi 


13623175 


gb 


AAK34830. 


gi 


1 3623052 


gb 


AAK34718.1 


gi 


13623176 


gb|AAK34831. 


gi 


1 3623053 


gb 


AAK34719.1 


I gi 


13623177 


gb|AAK34832. 


gi 


13623054 


gb 


AAK34720.1 


gi 


13623179 


gb 


AAK34834. 


gi 


1 3623056 


gb 


AAK34722.1 


gi 


1 36231 80|gb 


AAK34835. 


gi 


1 3623058 


gb 


AAK34724.1 


gi 


1 36231 82|gb 


AAK34836. 


gj 


1 3623062 


gb 


AAK34727.1 


gi 


1 36231 83|gb 


AAK34837. 


gi 


1 3623064 


gb 


AAK34729.1 


gi 


1 36231 84|gb 


AAK34838. 


gi 


1 3623065 


gb 


AAK34730.1 


gi 


1 36231 85|gb 


AAK34839. 


gi 


|1 3623069|gb|AAK34733.1 


gi 


1 36231 86|gb 


AAK34840. 


gi 


|13623074|gb 


AAK34738.1 


gi 


|13623187|gb 


AAK34841. 


gi 


1 3623081 


gb 


AAK34744.1 










gi 


1 3623082 


gb 


AAK34745.1 
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Table 34: List of GAS ORF's which 



ai 


13621381 |gb|AAK33195.' 


ai 


13621423|gb|AAK33233. 


ni 
y> 




gb|AAK33249." 


ni 


I OUf. I *T*TvJ 


gb|AAK33251. 


nil 




gb|AAK33260. 


ni 
y' 


A 'X&OAA^A 


gb 


AAK33261. 


ni 

y' 




gb 


AAK33284. 


ni 

y> 


J ODt 1 HO*£ 


gb 


AAK33287. 


ni 


I OO^C I Hc?<d. 


gb 


AAK33296. 


ni 

y* 




gb 


AAK33297. 


nii 

y'i 




gb 


AAK33300. 


nil 

y'l 




gb|AAK33301. 


nil 

y'i 


13621512 


gb|AAK33314. 


ni 

y' 


13621514 


gb 


rV-UxOOO I D. 


ni 
y' 


13621556 


gb 


AAK^W^A 
M/ArxOO O o*+ . 


ni 
y 1 


13621570 


gb 


/■\r\r\OOOUO. 


ai 
y 1 


13621587 


gb 


nMixOOOO^.. 


ni 

y* 


13621610 


gb 


nniNOO'rUO . 


ni 

y« 


13621613 


gb 


AAK33405 


ni 

y 1 


13621626 


gb 


AVAr\00*+ 1 o. 


ni 

y» 


13621632 


gb 


AAK33423. 


ai 
y 1 


13621635 


gb 


AAK33426. 


ai 
y 1 


13621643 


gb 


AAK33433. 


ni 
y' 


13621655 


gb 


AAK33444. 


ni 

yj 


13621656 


gb 


AAK33445. 


ni 

y> 


13621659 


gb 


AAK33448. 


ni 
y' 


13621673 


gb 


AAK33461 . 


ni 

y 1 


13621686|gb 


AAK33473. 


ni 

y» 


13621696|gb 


AAK33482. 


ni 

y 1 


13621703|gb 


AAK33488. 


ni 

y* 


13621712gb 


AAK33497. 


ai 


13621728 


gb 


AAK33511. 


ni 

y* 


13621738 


gb 


AAK33520. 


ai 
y* 


13621739 


gb 


AAK33521 . 


ni 

y> 


13621740|gb 


AAK33522. 


ni 

y 1 


13621 772 jgb 


AAK33551 . 


ni 

y 1 


113621776 


gb 


AAK33555. 


ni 

y* 


|13621791 


gb 


AAK33569. 


ni 

y 1 


113621798 


gb 


AAK33575. 


ai 

y» 


13621801 


gb 


AAK33578. 


ai 
y* 


13621803 


gb 


AAK33580. 


ai 
y 1 


113621804 


gb 


AAK33581 . 


ai 


13621832 


gb 


AAK33606. 


gi 


13621833 


gb 


|AAK33607. 


gi 


13621896|gb 


AAK33665. 


gi 


13621897 


gb 


AAK33666. 


gi 


13621906 


gb|AAK33674. 


gi 


13621911 


gb 


AAK33679. 


gi 


13621949 


gb 


AAK33713. 


gi 


13621951 


gb 


AAK33715. 


gi 


13621962 


gb 


AAK33724. 


gi 


13621963 


gb 


AAK33725. 


gi 


[13621964 


gb 


AAK33726. 


gi 


13621971 


gb 


AAK33732. 


gi 


13621976 


gb 


AAK33737. 


gi 


13621983 


gb|AAK33744. 
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shared with GBS but not with Spn 



ai 
y 1 


13621988 


gb 


AAK33748. 


ai 
y 1 


13622014 


gb 


AAK33772. 


ai 
y 1 


13622015 


gb 


AAK33773. 


ai 
y* 


13622022 


gb 


AAK33779. 


ai 
y 1 


13622023 


gb 


AAK33780. 


ni 
y 1 


13622028|gb 


AAK33784 


ai 
y 1 


1 3622029 


gb 


AAK33785. 


ni 
y 1 


13622037 


gb 


AAK33792. 

1 Vi VI \V f *— * 


ni 

y 1 


13622038 


gb 


AAK33793 

\vw f x#x^« 


ni 

y 1 


13622040 


gb 


AAK33795 


ni 

y 1 


13622057 


gb 


AAK3381 1 


ni 

y 1 


13622061 


gb 


AAK33814 


ni 


13622063|gb 


AAK3381 6 


ni 

y* 


1 3622066 


gb 


AAK3381 9 


ai 

y 1 


1 3622067 


gb 


AAK33820. 


ai 
y* 


1 3622076 


gb 


AAK33828. 


ai 
y* 


13622078 


gb 


AAK33830. 


gi 


13622084 


gb 


AAK33835. 


gi 


13622098 


gb 


AAK33848. 


gil 


13622099 


gb 


AAK33849. 


gi 


13622100 


gb 


AAK33850. 


gi 


13622104 


gb 


AAK33854. 


gi 


13622110 


gb 


AAK33859. 


gi 


13622116 


gb 


AAK33865. 


gi 


13622124 


gb 


AAK33873. 


gi 


13622159 


gb 


AAK33905. 


gi 


13622193 


gb 


AAK33936. 


gi 


13622194 


gb|AAK33937. 


gi 


13622195 


gb 


AAK33938. 


gi 


1 36221 96|gb 


AAK33939. 


gi|13622202|gb 


AAK33944. 


gi 


1 3622203 


gb 


AAK33945. 


gi 


1 3622206 


gb 


AAK33948. 


gi 


13622210 


gb 


AAK33951. 


g« 


1 3622221 


gb 


AAK33961. 


gi 


1 3622231 


gb 


AAK33971. 


gi 


13622234 


gb 


AAK33973. 


gi 


1 3622238 


gb 


AAK33977. 


gi 


13622245 


gb 


AAK33984. 


gi 


1 3622246 


gb|AAK33985. 


gi 


1 3622248 


gb 


AAK33986. 


gi 


1 3622249 


gb 


AAK33987. 


gi 


[13622251 


gb 


AAK33989. 


gi 


13622254 


gb 


AAK33992. 


gi 


1 3622267 


gb 


AAK34004. 


gi 


13622291 


gb 


AAK34026. 


gi 


|13622302 


gb 


AAK34036. 


gi 


1 3622303 


gb 


AAK34037. 


gi 


1 3622304 


gb 


AAK34038. 


gi 


13622327 


gb 


AAK34059. 


gi 


1 3622344 


gb 


AAK34074. 


gi 


|13622345 


gb 


AAK34075. 


gi 


1 3622346 


gb 


AAK34076. 


gi 


1 3622347 


gb 


AAK34077. 


gi 


1 3622348 


gb 


AAK34078. 


gi 


1 3622349 


gb 


AAK34079. 
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Table 34: List of GAS ORF's which are shared with GBS but not with Spn 



gi 


13622382|gb|AAK34109.1 


gi 


13622816 


gb 


AAK34504.1) 


gi 


1 3622386 


gb|AAK34113.1 


gi 


13622817 


gb 


AAK34505.1] 


gi 


13622391 


gb 


AAK34118.1 


gi 

53 


1 3622846 


gb 


AAK34531.1| 


gi 


1 3622392 


gb 


AAK34119.1 


I gi 

1 53* 


13622852 


gb 


AAK34536.1| 


gi 


1 3622397 


gb|AAK34123.1 


1 gi 

1 53" 


|13622874|gb 


AAK34556.1 


gi 


1 3622404 


gb|AAK34129.1 


1 gi 


|1 3622889|gb|AAK34570. 1 


gi 


(13622412 


gb)AAK34136.1 


1 gi 


[13622891 


gb|AAK34572.1| 


gi 


13622413 


gb 


AAK34137.1 


1 gi 


13622892 


gb|AAK34573.1| 


gi 


13622414 


gb 


AAK34 138.1 


gi 


13622897 


gb 


AAK34577.1 


gi 


13622418 


gb 


AAK34142.1 


gi 


1 3622902 


gb 


AAK34582.1 


gi 


13622430 


gb 


AAK34152.1 


gi 


113622904 


gb 


AAK34584.1 


gi| 13622446 


gb 


AAK34 167.1 


I gi 


|13622916|gb 


AAK34595.1 


Qi 


1 3622449 


gb 


AAK34 169.1 


gi 


13622923 


gb 


AAK34601.1 


Qi 

53 1 


13622453 


gb|AAK34173.1 


gi 


13622934 


gb 


AAK34611.1 


Qi 


13622470 


gb|AAK34188.1 


gi 


13622953|gb 


AAK34628.1 


Qi 


1 3622487 


gb 


AAK34204.1 


gi 


1 3622954 |gb 


AAK34629.1 


gi 

53 


1 3622490 


gb 


AAK34206.1 


gi 


13622960 


gb 


AAK34635.1 


Qi 

53 1 


1 3622502 


gb 


AAK34217.1 


gi 


1 3622968 


gb 


AAK34642.1 


gi 

53 


13622503|gb 


AAK34218.1 


| gi|13622980|gb 


AAK34653.1 


gi 


13622514 


gb|AAK34228.1 


gi 


13622987|gb 


AAK34659.1 


g> 


13622528 


gb 


AAK34241.1 


gi 


13623012 


gb 


AAK34682.1 


gi 


1 3622540 


gb 


AAK34252.1 


gi 


13623013 


gb 


AAK34683.1 


gi 


13622541 


gb 


AAK34253.1 


gi 


13623014 


gb 


AAK34684.1 


Qi 


1 3622544 


gb 


AAK34255.1 


gi 


13623015 


gb 


AAK34685.1 


gi 


1 3622545 


gb 


AAK34256.1 


. gi 


13623016 


gb 


AAK34686.1 


gi 


13622546 


gb 


AAK34257.1 


gi 


13623018 


gb 


AAK34687.1 


gi 


1 3622547 


gb 


AAK34258.1 


gi 


13623022 


gb 


AAK34691.1 


gi 


13622548 


gb|AAK34259.1 


gi 


13623029 


gb 


AAK34697.1 


gi 


13622550|gb|AAK34261 .1 


gi 


13623037 


gb|AAK34704.1 


gi 


13622551 


gb|AAK34262.1 


gi 


13623055 


gb|AAK34721.1| 


gi 


13622552 


gb 


AAK34263.1 


gi 


13623060 


gb 


AAK34725.1| 


gi 


1 3622556 


gb 


AAK34267.1 


gi 


13623061 


gb 


AAK34726.1| 


gi 


13622557 


gb 


AAK34268.1 


gi 


1 3623063 


gb 


AAK34728.1 


gi 


13622558 


gb 


AAK34269.1 


gi 


1 3623066 


gb 


AAK34731.1 


gi 


1 3622559 


gb 


AAK34270.1 


gi 


1 3623068 


gb 


AAK34732.1 


gi 


1 3622563 


gb 


AAK34273.1 


gi 


13623092 


gb 


AAK34754.1 


g< 


13622571 


gb 


AAK34281.1 


gi 


13623097|gb 


AAK34758.1 


gi 


13622576 


gb 


AAK34286.1 


gi 


136231 04 jgb 


AAK34765.1 


gi 


13622581 


gb 


AAK34290.1 


gi 


136231 26 |gb 


AAK34785.1 


gi 


13622582 


gb 


AAK34291.1 


gil 


13623130 


gb]AAK34789.1 


gi 


13622586 


gb 


AAK34295.1 


gi 


13623137 


gb 


AAK34795.1 


gi 


13622589 


gb|AAK34298.1 


gi 


13623153 


gb 


AAK34810.1 


gi 


1 3622605 


9b 


AAK34312.1] 


gi 


13623164 


gb 


AAK34820.1 


gi 


13622633|gb 


AAK34338.1j 


gi 


13623178 


gb 


AAK34833.1 


gi 


13622635|gb 


AAK34340.1| 








gi 


13622637|gb 


AAK34342.1| 








gi 


13622638|gb 


AAK34343.1| 








g> 


13622657|gb 


AAK34360.1 


- 








gi 


1 3622707 


gb 


AAK34404.1 










gi 


13622716 


gb 


AAK34413.1 










gi 


13622724 


gb 


AAK34420.1 










gi|13622732|gb 


AAK34427.1 










gi 


13622743|gb|AAK34437.1 1 










gi 


1 3622761 |gb 


AAK34453.1 










gi 


13622773|gb 


AAK34464.1 










gi 


13622788|gb 


AAK34478.1 
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Table 35: GAS ORFs which are shared with pneumococcus 

but not with GBS 



9" 


13621338 


gb 


AAK33157.1 


gi 


1 3623027 


gb 


AAK34695.1| 


gi 


13621352 


gb 


AAK33168.1 


gi 


13623087 


gb 


AAK34749.1| 


9' 


13621410 


gb 


AAK33221.1 


gi 


13623101 


gb 


AAK34762.1 


9' 


13621433 


gb 


AAK33242.1 


gi 


13623144 


gb 


AAK34802.1 


gi 


113621445 


gb 


AAK33253.1 


gi 


13623146 


gb 


AAK34804.1 


gi 


|13621446|gb 


AAK33254.1| gi 


13623147 


gb 


AAK34805.1| 


w 


13621447|gb 


AAK33255.1| 




gi 


13621448 


gb 


AAK33256.1 




gi 


13621449 


gb 


AAK33257.1 




gi 

57 


13621451 


gb 


AAK33259.1 




gi 


13621460 


gb 


AAK33267.1 




gi 


13621466 


gb 


AAK33272.1 




gi 


13621489 


gb 


AAK33293.1 




gi 

57 


13621490 


gb 


AAK33294.1 




gi 


13621519 


gb|AAK33320.1 


I 

1 


gi 

57 


13621520 


gb 


AAK33321.1 




gi 


13621653 


gb 


AAK33443.1 




gi 


13621722 


gb 


AAK33506.1 




gi 


13621723 


gb|AAK33507.1 




gi 


13621724 


gb 


AAK33508.1 


i 

1 


gi 


13621805 


gb 


AAK33582.1 


1 

1 


gi 


13621900 


gb 


AAK33669.1 


1 


gi 


13622011 


gb 


AAK33769.1 


1 

1 


gi 


13622212 


gb 


AAK33953.1 




gi 


13622280 


gb 


AAK34016.1 




gi 


13622381 


gb 


AAK34108.1 




gi 


13622409 


gb 


AAK34 134.1 




gi 


1 362241 0|gb 


AAK34135.1 




gi 


13622423 


gb 


AAK34146.1 




gi 

57 


1 3622428 


gb|AAK34151.1 




gi 


13622441 


gb 


AAK34162.1 




gi 


1 3622442 


gb 


AAK34 163.1 




gi 

57 


13622454 


gb 


AAK34 174.1 




gi 


1 3622456 


gb 


AAK34 176.1 




gi 


13622619 


gb 


AAK34325.1 


■ 


gi 


13622642 


gb 


AAK34346.1 




gi 


1 3622643 


gb 


AAK34347.1 




gi 


13622664|gb 


AAK34366.1| 




g> 


13622666 


gb 


AAK34368.1| 




gi 


13622667 


gb 


AAK34369.1 1 




gi 


13622671 


gb 


AAK34372.1 




gi 


1 3622672 


gb 


AAK34373.1 




gi 


13622673 


gb 


AAK34374.1 




gi 


13622674 


gb 


AAK34375.1 




gi 


13622679|gb|AAK34380.1 1 




gi 


1 3622680 


gb 


AAK34381.1| 




gi| 


1 3622682 


gb 


AAK34382.1 




gi| 


1 3622755 


gb|AAK34448.1 




gii 


13622758 


gb 


AAK34450.1 




gi| 


13622759 


gb 


AAK34451.1 




g'l 


13622835 


gb 


AAK34521.1 




g«l 


13622837 


gb 


AAK34523.1 




g>l 


13622937 


gb 


AAK34614.1] 




gi| 


13622942 


gb 


AAK34618.1] 




gi 


13622946 


gb 


AAK34622.1| 




g> 


13622978 


gb 


AAK34652.1| 
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Table 36: Spn ORF's are shared with GBS and GAS 



SP0001 


SP0158 


SP0254 


SP0385 


SP0002 


SP0173 


SP0259 


SP0386 


SP0003 


SP0179 


SP0261 


SP0387 


SP0004 


SP0180 


SP0262 


SP0400 


SP0005 


SP0184 


SP0263 


SP0401 


SP0006 


SP0185 


SP0264 


SP0402 


SP0007 


SP0186 


SP0265 


SP0403 


SP0008 


SP0187 


SP0266 


SP0404 


SP0010 


SP0189 


SP0268 


SP0405 


SP001 1 


SP0192 


SP0271 


SP0406 


SP0013 


SP0194 


SP0272 


SP0408 


SP0014 


SP0197 


SP0273 


SP041 0 


SP0019 


SP0199 


SP0274 


SP041 1 


SP0021 


SP0202 


SP0280 


SP041 2 


SP0024 


SP0204 


SP0281 


SP0415 


SP0027 


SP0205 


SP0282 


SP0416 


SP0032 


SP0208 


SP0283 


SP0417 


SP0033 


SP0209 


SP0284 


SP0418 


SP0034 


SP0210 


SP0285 


SP0419 


SP0035 


SP0211 


SP0286 


SP0420 


SP0036 


SP0212 


SP0287 


SP0421 


SP0037 


SP0213 


SP0289 


SP0422 


SP0042 


SP0214 


SP0290 


SP0423 


SP0044 


SP0215 


SP0291 


SP0424 


SP0045 


SP0216 


SP0292 


SP0425 


SP0046 


SP0217 


SP0294 


SP0426 


SP0047 


SP021 8 


SP0295 


SP0427 


SP0048 


SP0219 


SP0303 


SP0433 


SP0051 


SP0220 


SP0310 


SP0434 


SP0053 


SP0221 


SP0314 


SP0435 


SP0054 


SP0222 


SP0317 


SP0436 


SP0056 


SP0224 


SP0318 


SP0437 


SP0063 


SP0225 


SP0319 


SP0438 


SP0073 


SP0226 


SP0320 


SP0439 


SP0074 


SP0227 


SP0321 


SP0441 


SP0078 


SP0228 


SP0322 


SP0442 


SP0079 


SP0229 


SP0323 


SP0443 


SP0083 


SP0230 


SP0324 


SP0452 


SP0084 


* SP0231 


SP0325 


SP0453 


SP0085 


SP0232 


SP0327 


SP0454 


SP0095 


SP0233 


SP0330 


SP0457 


SP0105 


SP0234 


SP0334 


SP0458 


SP0106 


SP0235 


SP0336 


SP0459 


SP01 1 1 


SP0236 


SP0337 


SP0461 


SP0112 


SP0240 


SP0338 


SP0466 


SP01 1 8 


SP0242 


SP0340 


SP0467 


SP0120 


SP0243 


SP0342 


SP0474 


SP0121 


SP0245 


SP0369 


SP0477 


SP0122 


SP0246 


SP0370 


SP0478 


SP0127 


SP0247 


SP0371 


SP0483 


SP0128 


SP0248 


SP0373 


SP0486 


SP0129 


SP0249 


SP0374 


SP0488 


SP0148 


SP0250 


SP0381 


SP0489 


SP0149 


SP0251 


SP0382 


SP0493 


SP0151 


SP0252 


SP0383 


SP0494 


SP0152 


SP0253 


SP0384 


SP0499 
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Table 36: Spn ORFs are shared with GBS and GAS 



SP0500 


SP0652 


SP0787 


SP0895 


SP0501 


SP0657 


SP0788 


SP0896 


SP0502 


SP0660 


SP0792 


SP0897 


SP0515 


SP0662 


SP0793 


SP0904 


SP0516 


SP0663 


SP0797 


SP0905 


SP0517 


SP0665 


SP0798 


SP0908 


SP0519 


SP0668 


SP0799 


SP0909 


SP0521 


SP0669 


SP0801 


SP0912 


SP0522 


SP0671 


SP0802 


SP0923 


SP0523 


SP0672 


SP0803 


SP0927 


SP0526 


SP0673 


SP0805 


SP0928 


SP0549 


SP0674 


SP0806 


SP0929 


SP0550 


SP0675 


SP0807 


SP0931 


SP0552 


SP0676 


SP0816 


SP0932 


SP0553 


SP0678 


SP0817 


SP0933 


SP0554 


SP0680 


SP0820 


SP0935 


SP0555 


SP0681 


SP0822 


SP0936 


SP0556 


SP0687 


SP0823 


SP0937 


SP0557 


SP0688 


SP0824 


SP0938 


SP0563 


SP0689 


SP0825 


SP0943 


SP0567 


SP0690 


SP0828 


SP0944 


SP0568 


SP0701 


SP0829 


SP0945 


SP0576 


SP0702 


SP0831 


SP0946 


SP0577 


SP0709 


SP0835 


SP0947 


SP0578 


SP0713 


SP0837 


SP0948 


SP0579 


SP0726 


SP0838 


SP0954 


SP0581 


SP0727 


SP0839 
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Table 36: Spn ORF's are shared with GBS and GAS 
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Table 36: Spn ORF's are shared with GBS and GAS 
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Table 37: Spn ORF's which are shared with GBS but not with GAS 
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Table 38: Spn ORF's which are shared with GAS but no with GBS 
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Table 40: Comparative Sequences relating to SAG0635 



SEQ ID NO 4001 : SAGO 653 FROM THE 2603 V/R GBS TYPE V STRAIN 

ATGAAGAAAGTGTTAGTGAGTAGTCTTTTGGTTTTAGGGATTACGATA 
ACGTTACAAACAGTAGTTGAGGCTAAGGGGCCAAAAGTAGC 

ACTG CTCTTTCGGACACAAATAAAG ATAAAGT CACTACT ATTTCT ATTG ACG AG ATTCAA 
AAAAG CTTAGAAGGTAAGAAGC CXSATTACTGTTAGTTTTGATATTGATGATACACrrc CTT 
TTCAGTAGTGAATATTTTCAATATGGTAAAGAATATGTAACTCCT 

CTT CATAAACAAAAATT CTGGGAT CTT GT TG CAAAACGAGGAGAT CAAG ATTCCATT C C C 
AAAGAATATG CTAAAAAATTAATTG CT ATG CATCAAAAACGAGG AGATAAAATTGTTTTT 
ATAACAG GT AGGACAAGAGGGT CAATGTAT AAGGAGGG CG AGGTTGATAAAACAG CT AAA 
GC CTTAG CT AAAGAT TTTAAATTAGACAAACC AATTG CTGTAAATTATACAGG CG ATAAA 
CCTAAAAAGCCATACAAATATGATAAATCATATTATATTAAGAAATATGGTTCAGACATT 
CATTATGGAGATAGTGATG ACG AT ATTCATGCAG CTAGG G AGGC CGGTG CTAGAC CAATT 
AG AATTTTAAGAG CACCTAATT CT ACAAAT CT AC CTTTAC CAGAAG CTGG AGG CT ACGGT 
GAAGAGGTTCTCGAAAATTCAGCTTAC 

SEQ ID NO 4002 : SAGO 653 FROM THE 090 GBS TYPE III STRAIN 

AAGGGGCCAAAAGTAGCTTATACACAAGAGGGAATGAC 

TG CT CTTTCGGACACAAATAAAGATAAAGT CACTACTATTTCTATTGACG 

AGATT CAAAAAAG CTTAGAAGGTAAGAAG CCGATTACTGTTAGTTTTGAT 

ATTGATG ATACACTACT TTT CAGT AGT CAATATTTT CAATATGGT AAAGA 

ATATGTAACTCCTGGATCGTTTGATTTTCTTCATAAACAAAAATTCTGGG 

ATCTTGTTG CAAAACGAGGAGAT CAAG ATT C CATT C C CAAAG AATATG CT 

AAAAAATTAATTGCTATGCATCAAAAACGAGGAGATAAAATTGTTTTTAT 

AACAGGTAGGACAAGAGGGTCAATGTATAAGGAGGGCGAGGTTGATAAAA 

CAGCTAAAG CCTTAGCTAAAGATTTTAAATTAGACAAACCAATTGCTGTA 

AATTATACAGGCGATAAACCTAAAAAGCCATACAAATATGATAAATCATA 

TTATATTAAGAAATATGGTTCAGACATTCATTATGGAGATAGTGATGACG 

AT ATT CATG CAG CTAGG GAGGC CG GTG CTAGACCAAT TAGAATTTTAAG A 

GCACCTAATTCT ACAAAT CT AC CTTTAC CAGAAG CTG GAGGCTACGGTGA 

AG AGGTT CT C GAAAATT CAGCTTAC 

SEQ ID NO 4003 : SAGO 653 FROM THE A909 GBS TYPE la STRAIN 

AAGGGGCCAAAAGTAGCTTATACACA 

AGAGGGAATGACTGCTCTTTCGGACACAAATAAAGATAAAGTCACTACTA 
TTT CTATTG ACG AG ATTCAAAAAAG CTTAGAAGGTAAGAAG CCG ATTACT 
GTTAGTTCTGATATTGATGATACACTGCTTCTC^GTAGTCAATATTTTCA 
ATATGGTAAAGAATATGTAACTCCTGGATCGTTTGATTTTCTTCATAAAC 
AAAAATTCTGGGATCTTGTTGCAAAACGAGGAGATCAAGATTCCATTCCC 
AAAGAATATG CTAAAAAATTAATTG CT ATG CAT CAAAAACGAGG AGATAA 
AATTGTTTTTATAACAGGTAGGACAAGAGGGTCAATGTATAAGGAGGGCG 
AGGCTGATAAAACAGCTAAAGCCTTAGCTAAAGATTTTAAATTAGACAAA 
CCAATTGCTGTAAATTATACAGGCGATAAACCTAAAAAGCa^TACAAATA 
TGATAAATCATATTATATTAAGAAATATGGTTCAGACATTCATTATGGAG 
ATAGTGATGACGATATTCATGCAGCTAGGGAGGCCGGTGCTAGACCAATT 
AGAATTTTAAGAG CACCTAATT CT ACAAAT CTAC CTTTACCAGAAG CTGG 
AGGCTACGGTGAAGAGGTTCTCGAAAATT CAGCTTAC 

SEQ ID NO 4004 : SAGO 65 3 FROM THE 18RS21 GBS TYPE II STRAIN 

AAGGG G C CAAAAGTAG CTT ATACACAAGA 

GGG AATGACTGCT CTTT CGGACACAAATAAAG AT AAAGTCACTACT ATTT 
CTATTGACGAGATTCAAAAAAGCTTAGAAGGTAAGAAGCCGATTACTGTT 
AGTTTTG ATATTGATGATACACTG CTTTT CAGTAGTCAATATTT T CAATA 
TGGTAAAGAATATGTAACTCCTGGATCGTTTGATTTTCTTCATAAACAAA 
AATTCTGGG ATCTTGTTG CAAAACG AGGAGATCAAGATT CCATTC C CAAA 
GAATATG CTAAAAAACT AATTG CTATG CAT CAAAAACGAGGAGATAAAAT 
TGTTTTTAT AACAGGTAGGACAAG AGGGT CAATGTATAAG GAGGG CGAGG 
TTGATAAAACAGCTAAAGCCTTAGCTAAAGATTTTAAATTAGACAAACCA 
ATTGCTGTAAATTATACAGGCGATAAACCTAAAAAGCCATACAAATATGA 
TAAATC^TATTATATTAAGAAATATGGTTCAGACATTCATTATGGAGATA 
GTGATGACGATATTCATGCAGCTAGGGAGGCCGGTGCTAGACCAATTAGA 
ATTTTAAGAG CACCTAATT CT ACAAAT CTAC CTTTAC CAGAAG CTGG AGG 
CTACGGTG AAGAGGTTCT CG AAAATT CAG CTT AC 

SEQ ID NO 4005 : SAGO 653 FROM THE M732 GBS TYPE III STRAIN 

AAGGGGC CAAAAGTAG CTTATACACAAG A 

GGGAATGACTGCTCTTTCGGACACAAATAAAGATAAAGTCACTACTATTT 
CTATTGACGAGATTCAAAAAAGCTTAGAAGGTAAGAAGCCGATTACTGTT 
AGTTTTGATATTGATGATACACTGCTTTTC^GTAGTCAATATTrTCAATA 
TGGTAAAGAATATGTAACTCCTGGATCGTTTGATTTTCTTCATAAACAAA 
AACT CTG GGATCTTGTTGCAAAACGAGGAGATCAAG ATT C CATT C C CAAA 
GAATATG CTAAAAAATTAATTG CT ATG CAT CAAAAACGAGGAG ATAAAAT 
TGTTTTTATAACAGGTAGGACAAGAGGGTCAATGTATAAGGAGGGCGAGG 
TTG AT AAAACAG CTAAAG C CTTAG CTAAAG ATTTT AAATTAG ACAAAC CA 
ATTGCTGTAAATTATACAGGCGATAAACCTAAAAAGCCATACAAATATGA 
TAAATCATATTATATTAAGAAATATGGTTCAGACATTCATTATGGAGATA 
GTGATGACGATATTCATGCAGCTAGGGAGGCCGGTGCTAGACCAATTAGA 
ATTTTAAGAG CAC CT AATTCTACAAAT CTAC CTT TAC CAG AAGCTGG AG G 
CTACGGTGAAGAGGTTCTCGAAAATTCAGCTTAC 

SEQ ID NO 4006 : SAGO 65 3 FROM THE COH1 GBS TYPE III STRAIN 

AAGGGGCCAAAAGTAGCTTATACACAAGAGGGAATGACT 
GCTCTTTCGGACACAAATAAAGATAAAGTCACTACTATTTCTATTGACGA 
GATTCAAAAAAG CTTAG AAG GT AAGAAG C CGATT ACTGTTAGTTT TG AT A 
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TTGATGATACACTG CTTTT CAGTAGTCAATATTTT CAATATGGT AAAGAA 
TATGTAACT CCTGGATC GTTTG ATTTT CTT CAT AAACAAAAATT CTG GGA 
TCTTGTTG CAAAACG AGGAG ATCAAGATT CCATT C CCAAAGAATATG CTA 
AAAAATT AATTG CTATGCAT CAAAAACGAG GAGATAAAATTGTTTTTATA 
ACAGGTAGG ACAAGAGGGT CAATGTATAAGGAGGG CG AGGTTGATAAAAC 
AG CTAAAGCCTTAG CTAAAG ATTTTAAATTAGAGAA^CCAATTG CTGTAA 
ATTATACAGGCGATAAACCTAAAAAGCCATACAAATATGATAAATCATAT 
TATATTAAGAAATATGGTTCAGACATTCATTATGGAGATAGTGATGACGA 
TATTCATGCAG CTAGGGAG G CCGGTG CTAGACGAATTAGAATTTTAAGAG 
CACCTAATTCTACAAATCTACCTTTACCAGAAGCTGGAGGCTACGGTGAA 
GAGGTTCTCGAAAATTCAGCTTAC 

SEQ ID NO 4007 : SAG0653 FROM THE M781 GBS TYPE III STRAIN 

AAGGGGCCAAAAGTAGCTTATACACA 

AGAGGGAATGACTG CT CTTT CGGACACAAATAAAG ATAAAGT CACTACTA 

TTTCTATTG ACGAGATT CAAAAAAG CTTAGAAGGTAAGAAG CCGATTACT 

GTTAGTTTTG ATATTGATG ATACACTG CTTTT CAGTAGTCAATATTTT CA 

ATATGGTAAAGAATATGTAACTCCTGGAT CGTTTG ATTTT CTTCATAAAC 

AAAAATT CTGGGATCTTGTTGCAAAACGAGGAGATOXAGATTCCATTCCC 

AAAGAAT ATG CTAAAAAATTAATTG CT ATG CATCAAAAACGAGGAGATAA 

A^TTGTTTTTATAACAGGTAGGACAAGAGGGTCAATGTATAAGGAGGGCG 

AGGTTGATAAAACAGCTAA^GCCTTAGCTAAAGATTT^ 

CCAATTG CTGTAAATTATACAGG CG ATAAACCTAAAAAGCCATACAAATA 

TGATAAATCATATTATATTAAGAAATATGGTTCAGACATTCATTATGGAG 

ATAGTGATGACGATATTCATGCAGCTAGGGAGGCCGGTGCTAGACCAATT 

AGAATTTTAAGAGC^CCTAATTCTACAAATCTACCTTTACCAGAAGCTGG 

AGGCTACGGTGAAGAGGTTCTCGAAAATTCAGCTTAC 

SEQ ID NO 4008 : SAGO 653 FROM THE CJB110 GBS NONTYPEABLE STRAIN 

AAGGGGCCAAAAGTAGCTTATACACAAGA 

GGGAATG ACT G CT CTTTCGG ACACAAATAAAGATAAAGTCACTACT ATTT 
CTATTGACGAGATT CAAAAAAG CTT AG AAGGTAAGAAG CCGATT ACTGTT 
AGTTTTGATATTGATGAT ACACTGCTTTT CAGTAGT CAATATTTT CAATA 
TGGT AAAGAATATGT AACT C CTGGATCGTTTG ATTTT CTT CATAAACAAA 
AATTCTGGGATCTTGTTGCAAAACGAGGAGATCAAGATTCCATTCCCA 
GAATATG CTAAAAAATTAATTG CTATG CATCAAAAACGAGGAGATAAAAT 
TGTTTTTATAACAGGTAGGACAAGAGGGTCAATGTATAAGGAGGGCGAGG 
TTGATAAAACAGCTAAAGCCTTAGCTAAAGATTTTAAATTAGACAAACCA 
ATTG CTGTAAATTAT ACAGG CGATAAACCT AAAAAGC CAT ACAAATATGA 
TAAATCATATTATATTAAGAAATATGGTTCAGACATTCATTATGGAGATA 
GTGATGACG ATATT CATG CAGCTAGGGAG G CCGGTGCTAGACCAATTAGA 
;VTTTT AAGAG CACCTAATT CTACAAAT CTAC CTTTACCAGAAG CTGG AGG 
CTACGGTGAAGAGGTTCTCGAAAATTCAGCTTAC 

SEQ ID NO 4009 : SAGO 6 53 FROM THE JM9130013 GBS TYPE VIII STRAIN 

AAGG GGC CAAAAGTAGCTTATACACAAGAGGG AAT 

GACTG CT CTT TCGGACACAAAT AAAGATAAAGTGACT^ 

ACGAG ATTCAAAAAAGCTTAGAAGGTAAG AAG CCGATTACTGTTAGT TTT 

GATATTGATGATACA CTG CTTTT CAGTAGT CAATATTTT CAATATGGTAA 

AGAATATGTAACTCCTGGATCGT1^C^TTTTCTTCA.TAAACAAAAATTCT 

GG GAT CTTGTTG CAAAACGAGGAGAT CAAG ATTC CATT CC CAAAGAATAT 

GCTAAAAAATTAATTGCTATGC^TCAAAAACGAGGAGATAAAATTGTTTT 

TATAACAGGTAGGACAAGAGGGT CAATGT ATAAGGAG GG CGAGGTTG ATA 

AAACAGCTAAAGCCTTAGCTAAAGATTTTAAATTAGACAAACCAATTGCT 

GTAAATTATACAGG CGATAAAC CTAAAAAG CCAT ACAAAT ATGATAAAT C 

ATATTATATT AAGAAATATG GTT CAGACATT CATTATGG AGATAGTGATG 

ACGATATTCATGCAGCTAGGGAGGCCGGTGCTAGACCMTTAGAATTTTA 

AGAG CACCTAATTCTACAAATCTAC CTTTACCAGAAG CTGGAGG CTA CGG 

TGAAGAGGTTCTCGAAAATTCAGCTTAC 
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PRETTY of: /biottnp/msa20031.2{*} August 5, 2002 07:05 



msa2 0031 .2{l00_18RS2l} 
msa20031.2{l00_2603} 
msa2 0031.2{l00_A909} 
msa2 0031 . 2 { 100_CJB110 } 
msa20031.2{l00_COHl} 
msa20O31.2{l0O_JM9130013} 
msa20031.2{l00_M732} 
msa20031.2{l00 M781} 
msa2 0031 .2{l00_090} 
Consensus 



AAGGGGCCAA 

AAGGGGCCAA 

AAGGGGCCAA 

AAGGGGCCAA 

AAGGGGCCAA 

AAGGGGCCAA 

AAGGGGCCAA 

AAGGGGCCAA 

AAGGGGCCAA 
********** 



AAGTAGCTTA 

AAGTAGCTTA 

AAGTAGCTTA 

AAGTAGCTTA 

AAGTAGCTTA 

AAGTAGCTTA 

AAGTAGCTTA 

AAGTAGCTTA 

AAGTAGCTTA 
********** 



TACACAAGAG 

TACACAAGAG 

TACACAAGAG 

TACACAAGAG 

TACACAAGAG 

TACACAAGAG 

TACACAAGAG 

TACACAAGAG 

TACACAAGAG 
********** 



GGAATGACTG 
GGAATGACTG 
GGAATGACTG 
GGAATGACTG 
GGAATGACTG 
GGAATGACTG 
GGAATGACTG 
GGAATGACTG 
GGAATGACTG 



50 

CTCTTTCGGA 
CTCTTTCGGA 
CTCTTTCGGA 
CTCTTTCGGA 
CTCTTTCGGA 
CTCTTTCGGA 
CTCTTTCGGA 
CTCTTTCGGA 
CTCTTTCGGA 



********** ********** 



msa2 0031 . 2 { 100_18RS21 } 
msa20031. 2 {100^2603} 
msa20031.2{l00_A909} 
msa20031 . 2 { 100_CJB110 } 
msa2 0 03 1 . 2 { 10 0_COH1 } 
msa20031 . 2 { 100_JM9130013 } 
msa20031 .2{100_M732} 
msa20031.2{lOO_M78l} 
msa2 0031 -2{l00_090} 
Consensus 



51 

CACAAATAAA 
CACAAATAAA 
CACAAATAAA 
CACAAATAAA 
CACAAATAAA 
CACAAATAAA 
CACAAATAAA' 
CACAAATAAA 
CACAAATAAA 
********** 



GATAAAGTCA 

GATAAAGTCA 

GATAAAGTCA 

GATAAAGTCA 

GATAAAGTCA 

GATAAAGTCA 

GATAAAGTCA 

GATAAAGTCA 

GATAAAGTCA 
********** 



CTACTATTTC 

CTACTATTTC 

CTACTATTTC 

CTACTATTTC 

CTACTATTTC 

CTACTATTTC 

CTACTATTTC 

CTACTATTTC 

CTACTATTTC 
********** 



TATTGACGAG 

TATTGACGAG 

TATTGACGAG 

TATTGACGAG 

TATTGACGAG 

TATTGACGAG 

TATTGACGAG 

TATTGACGAG 

TATTGACGAG 
********** 



100 

ATTCAAAAAA 

ATTCAAAAAA 

ATTCAAAAAA 

ATTCAAAAAA 

ATTCAAAAAA 

ATTCAAAAAA 

ATTCAAAAAA 

ATTCAAAAAA 

ATTCAAAAAA 
********** 



msa2OO31.2{l00_18RS2l} 
msa20031 .2{ 100^2603} 
msa20031 .2{100_A909} 
msa20031 . 2 { 100_CJB110 } 
msa2 0031.2(10 0_COH1 } 
msa20031 . 2 {l00_JM9130013 } 
msa20031.2{l00_M732} 
msa20031.2{lOO_M78l} 
msa20031.2{l00_090} 
Consensus 



101 

GCTTAGAAGG 

GCTTAGAAGG 

GCTTAGAAGG 

GCTTAGAAGG 

GCTTAGAAGG 

GCTTAGAAGG 

GCTTAGAAGG 

GCTTAGAAGG 

GCTTAGAAGG 
********** 



TAAGAAGCCG 

TAAGAAGCCG 

TAAGAAGCCG 

TAAGAAGCCG 

TAAGAAGCCG 

TAAGAAGCCG 

TAAGAAGCCG 

TAAGAAGCCG 

TAAGAAGCCG 
********** 



ATTACTGTTA 

ATTACTGTTA 

ATTACTGTTA 

ATTACTGTTA 

ATTACTGTTA 

ATTACTGTTA 

ATTACTGTTA 

ATTACTGTTA 

ATTACTGTTA 
********** 



GTTTTGATAT 
GTTTTGATAT 
GTTTTGATAT 
GTTTTGATAT 
GTTTTGATAT 
GTTTTGATAT 
GTTTTGATAT 
GTTTTGATAT 
GTTTTGATAT 



150 

TGATGATACA 
TGATGATACA 
TGATGATACA 
TGATGATACA 
TGATGATACA 
TGATGATACA 
TGATGATACA 
TGATGATACA 
TGATGATACA 



********** ********** 



msa20031 .2 { 
msa20031 . 
msa20031 . 
msa20031.2 { 
msa20031 . 
msa20031.2 {100 
msa20031 . 
msa20031 . 
msa20031 



100_18RS2l} 
2{100_2603} 
2{100__A909} 
100_CJB110} 
2{100_COH1} 
_JM9130013} 
2{100__M732} 
2{100_M781} 
.2{100_090} 
Consensus 



151 

CTgCTTTTCA 

CTgCTTTTCA 

CTgCTTTTCA 

CTgCTTTTCA 

CTgCTTTTCA 

CTgCTTTTCA 

CTgCTTTTCA 

CTgCTTTTCA 

CTaCTTTTCA 
**_******* 



GTAGTCAATA 

GTAGTCAATA 

GTAGTCAATA 

GTAGTCAATA 

GTAGTCAATA 

GTAGTCAATA 

GTAGTCAATA 

GTAGTCAATA 

GTAGTCAATA 
********** 



TTTTCAATAT 
TTTTCAATAT 
TTTTCAATAT 
TTTTCAATAT 
TTTTCAATAT 
TTTTCAATAT 
TTTTCAATAT 
TTTTCAATAT 
TTTTCAATAT 



GGTAAAGAAT 
GGTAAAGAAT 
GGTAAAGAAT 
GGTAAAGAAT 
GGTAAAGAAT 
GGTAAAGAAT 
GGTAAAGAAT 
GGTAAAGAAT 
GGTAAAGAAT 



200 

ATGTAACTCC 
ATGTAACTCC 
ATGTAACTCC 
ATGTAACTCC 
ATGTAACTCC 
ATGTAACTCC 
ATGTAACTCC 
ATGTAACTCC 
ATGTAACTCC 



********** ********** ********** 



msa20031.2{!00_18RS2l} 
msa20031 . 2 {100_2603 } 
msa2003 1 . 2 { 100_A909 } 
msa2 0031.2{lOO_CJB110} 
msa20031-2{l00_COHl} 
msa20031. 2 { 100_JM9130013 } 
msa20031 . 2 { 100J4732 } 
msa2003 1 . 2 { 100_M781 J 
msa20031 . 2 { 100_090 } 
Consensus 



201 

TGGATCGTTT 

TGGATCGTTT 

TGGATCGTTT 

TGGATCGTTT 

TGGATCGTTT 

TGGATCGTTT 

TGGATCGTTT 

TGGATCGTTT 

TGGATCGTTT 
********** 



GATTTTCTTC 
GATTTTCTTC 
GATTTTCTTC 
GATTTTCTTC 
GATTTTCTTC 
GATTTTCTTC 
GATTTTCTTC 
GATTTTCTTC 
GATTTTCTTC 
********** 



ATAAACAAAA 

ATAAACAAAA 

ATAAACAAAA 

ATAAACAAAA 

ATAAACAAAA 

ATAAACAAAA 

ATAAACAAAA 

ATAAACAAAA 

ATAAACAAAA 
********** 



ATTCTGGGAT 

ATTCTGGGAT 

ATTCTGGGAT 

ATTCTGGGAT 

ATTCTGGGAT 

ATTCTGGGAT 

ATTCTGGGAT 

ATTCTGGGAT 

ATTCTGGGAT 
********** 



250 

CTTGTTGCAA 

CTTGTTGCAA 

CTTGTTGCAA 

CTTGTTGCAA 

CTTGTTGCAA 

CTTGTTGCAA 

CTTGTTGCAA 

CTTGTTGCAA 

CTTGTTGCAA 
********** 



rnsa20031.2{l00_18RS21 
msa2003 1 . 2 { 100_2603 
msa2003 1 . 2 { 100_A909 
msa20031 . 2 { 100_CJB110 } 
msa2 003 1 . 2 { 100_COH1 } 
msa20031.2{lOO_JM9130013* 
msa20031 .2{100_M732 
msa20031.2{l00_M781 
msa20031.2{l00_090} 
Consensus 



251 

AACGAGGAGA 

AACGAGGAGA 

AACGAGGAGA 

AACGAGGAGA 

AACGAGGAGA 

AACGAGGAGA 

AACGAGGAGA 

AACGAGGAGA 

AACGAGGAGA 
********** 



TCAAGATTCC 

TCAAGATTC C 

TCAAGATTCC 

TCAAGATTCC 

TCAAGATTCC 

TCAAGATTCC 

TCAAGATTCC 

TCAAGATTCC 

TCAAGATTCC 
********** 



ATTCCCAAAG 

ATTCCCAAAG 

ATTCCCAAAG 

ATTCCCAAAG 

ATTCCCAAAG 

ATTCCCAAAG 

ATTCCCAAAG 

ATTCCCAAAG 

ATTCCCAAAG 
********** 



AATATGCTAA 

AATATGCTAA 

AATATGCTAA 

AATATGCTAA 

AATATGCTAA 

AATATGCTAA 

AATATGCTAA 

AATATGCTAA 

AATATGCTAA 
********** 



300 

AAAATTAATT 

AAAATTAATT 

AAAATTAATT 

AAAATTAATT 

AAAATTAATT 

AAAATTAATT 

AAAATTAATT 

AAAATTAATT 

AAAATTAATT 
********** 



msa20031 . 2 { 100_18RS21 } 
msa20031 . 2 { 100_2603 } 
msa20031 .2{100_A909 
msa20031 . 2 { 100_CJB110 
msa20031 . 2 { 100_COH1 
msa2O031.2{l0O_JM9130013 
msa20031 .2{100_M732; 
msa20031 .2{100_M781; 



301 

GCTATGCATC 
GCTATGCATC 
GCTATGCATC 
GCTATGCATC 
GCTATGCATC 
GCTATGCATC 
GCTATGCATC 
GCTATGCATC 



AAAAACGAGG 
AAAAACGAGG 
AAAAACGAGG 
AAAAACGAGG 
AAAAACGAGG 
AAAAACGAGG 
AAAAACGAGG 
AAAAACGAGG 



AGATAAAATT 
AGATAAAATT 
AGATAAAATT 
AGATAAAATT 
AGATAAAATT 
AGATAAAATT 
AGATAAAATT 
AGATAAAATT 



GTTTTTATAA 
GTTTTTATAA 
GTTTTTATAA 
GTTTTTATAA 
GTTTTTATAA 
GTTTTTATAA 
GTTTTTATAA 
GTTTTTATAA 



350 

CAGGTAGGAC 
CAGGTAGGAC 
CAGGTAGGAC 
CAGGTAGGAC 
CAGGTAGGAC 
CAGGTAGGAC 
CAGGTAGGAC 
CAGGTAGGAC 
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msa20031 .2{lOO_090} 
Consensus 



GCTATGCATC AAAAACGAGG AGATAAAATT GTTTTTATAA CAGGTAGGAC 
********** ********** ********** ********** ********** 



msa20031 .2{l00_18RS2l} 
msa20031.2{l00_2603} 
msa20031. 2{100_A909} 
msa20031 .2 { 100_CJB110} 
msa2 0 03 1 . 2 { 10 0_COH1 } 
msa20031.2{lOO_JM9130013} 
msa20031. 2{100_M732} 
msa20031.2{lOO_M78l}- 
msa20031.2{l00_090} 
Consensus 



351 

AAGAGGGTCA 
AAGAGGGTCA 
AAGAGGGTCA 
AAGAGGGTCA 
AAGAGGGTCA 
AAGAGGGTCA 
AAGAGGGTCA 
AAGAGGGTCA 
AAGAGGGTCA 
********** 



ATGTATAAGG 

ATGTATAAGG 

ATGTATAAGG 

ATGTATAAGG 

ATGTATAAGG 

ATGTATAAGG 

ATGTATAAGG 

ATGTATAAGG 

ATGTATAAGG 
********** 



AGGGCGAGGT 

AGGGCGAGGT 

AGGGCGAGGT 

AGGGCGAGGT 

AGGGCGAGGT 

AGGGCGAGGT 

AGGGCGAGGT 

AGGGCGAGGT 

AGGGCGAGGT 
********** 



TGATAAAACA 

TGATAAAACA 

TGATAAAACA 

TGATAAAACA 

TGATAAAACA 

TGATAAAACA 

TGATAAAACA 

TGATAAAACA 

TGATAAAACA 
********** 



400 

GCTAAAGCCT 

GCTAAAGCCT 

GCTAAAGCCT 

GCTAAAGCCT 

GCTAAAGCCT 

GCTAAAGCCT 

GCTAAAGCCT 

GCTAAAGCCT 

GCTAAAGCCT 
********** 



msa20031 ,2{ 
msa20031 . 
msa20031 . 
msa20031 .2{ 
msa20031 . 
msa20031.2{l00 
msa20031 . 
msa20031 . 
msa20031 



100_18RS2l} 
2{100_2603} 
2{100_A909} 
100_CJB110} 
2{100_COH1} 
_JM9130013} 
2{100_M732} 
2{100_M781} 
2{l00_090} 
Consensus 



401 

TAGCTAAAGA 
TAGCTAAAGA 
TAGCTAAAGA 
TAGCTAAAGA 
TAGCTAAAGA 
TAGCTAAAGA 
TAGCTAAAGA 
TAGCTAAAGA 
TAGCTAAAGA 
********** 



TTTTAAATTA 

TTTTAAATTA 

TTTTAAATTA 

TTTTAAATTA 

TTTTAAATTA 

TTTTAAATTA 

TTTTAAATTA 

TTTTAAATTA 

TTTTAAATTA 
********** 



GACAAACCAA 
GACAAACCAA 
GACAAACCAA 
GACAAACCAA 
GACAAACCAA 
GACAAACCAA 
GACAAACCAA 
GACAAACCAA 
GACAAACCAA 



TTGCTGTAAA 
TTGCTGTAAA 
TTGCTGTAAA 
TTGCTGTAAA 
TTGCTGTAAA 
TTGCTGTAAA 
TTGCTGTAAA 
TTGCTGTAAA 
TTGCTGTAAA 



********** ********** 



450 

TTATACAGGC 

TTATACAGGC 

TTATACAGGC 

TTATACAGGC 

TTATACAGGC 

TTATACAGGC 

TTATACAGGC 

TTATACAGGC 

TTATACAGGC 
********** 



msa20031 .2{ 
msa20031 . 
msa20031. 
msa20031 .2{ 
msa20031 . 
msa20031.2{l00 
msa20031 . 
msa20031 . 
msa20031 



100_18RS2l} 
2{100_2603} 
2{100_A909} 
100_CJB110' 
2{100_COH1 
_JM9130013 
2{l00_M732} 
2{100_M781) 
.2{l00_090} 
Consensus 



451 

GATAAACCTA 

GATAAACCTA 

GATAAACCTA 

GATAAACCTA 

GATAAACCTA 

GATAAACCTA 

GATAAACCTA 

GATAAACCTA 

GATAAACCTA 
********** 



AAAAGCCATA 

AAAAGCCATA 

AAAAGCCATA 

AAAAGCCATA 

AAAAGCCATA 

AAAAGCCATA 

AAAAGCCATA 

AAAAGCCATA 

AAAAGCCATA 
********* * 



CAAATATGAT 

CAAATATGAT 

CAAATATGAT 

CAAATATGAT 

CAAATATGAT 

CAAATATGAT 

CAAATATGAT 

CAAATATGAT 

CAAATATGAT 
********** 



AAATCATATT 

AAATCATATT 

AAATCATATT 

AAATCATATT 

AAATCATATT 

AAATCATATT 

AAATCATATT 

AAATCATATT 

AAATCATATT 
********** 



500 

ATATTAAGAA 
ATATTAAGAA 
ATATTAAGAA 
ATATTAAGAA 
ATATTAAGAA 
ATATTAAGAA 
ATATTAAGAA 
ATATTAAGAA 
ATATTAAGAA 
********** 



msa20031 .2{ 
msa20031. 
msa20031 . 
msa20031 -2{ 
msa20031. 
msa20031.2{l00 
maa20031. 
msa20031. 
msa20031 



100_18RS2l} 
2 {100^2603} 
2{100_A909} 
100_CJB110} 
2{100_COH1} 
_JM9130013) 
2{100_M732" 
2{l00_M781 
-2{100_090; 
Consensus 



501 

ATATGGTTCA 
ATATGGTTCA 
ATATGGTTCA 
ATATGGTTCA 
ATATGGTTCA 
ATATGGTTCA 
ATATGGTTCA 
ATATGGTTCA 
ATATGGTTCA 



GACATTCATT 
GACATTCATT 
GACATTCATT 
GACATTCATT 
GACATTCATT 
GACATTCATT 
GACATTCATT 
GACATTCATT 
GACATTCATT 



********** ********** 



ATGGAGATAG 

ATGGAGATAG 

ATGGAGATAG 

ATGGAGATAG 

ATGGAGATAG 

ATGGAGATAG 

ATGGAGATAG 

ATGGAGATAG 

ATGGAGATAG 
********** 



TGATGACGAT 

TGATGACGAT 

TGATGACGAT 

TGATGACGAT 

TGATGACGAT 

TGATGACGAT 

TGATGACGAT 

TGATGACGAT 

TGATGACGAT 
********** 



550 

ATTCATGCAG 

ATTCATGCAG 

ATTCATGCAG 

ATTCATGCAG 

ATTCATGCAG 

ATTCATGCAG 

ATTCATGCAG 

ATTCATGCAG 

ATTCATGCAG 
********** 



msa20031.2{l00_18RS2l} 
msa20031 . 2 {l00_2603 } 
msa2 0031 . 2 { 100_A909 } 
msa2 0031 . 2 { 100_CJB110 } 
msa2 003 1 . 2 { 100_COH1 } 
msa20031-2{l00_JM9130013" 
msa20031. 2{100_M732 
msa2 0031. 2{100__M781 
msa20031. 2 { 100_090} 
Consensus 



551 

CTAGGGAGGC 

CTAGGGAGGC 

CTAGGGAGGC 

CTAGGGAGGC 

CTAGGGAGGC 

CTAGGGAGGC 

CTAGGGAGGC 

CTAGGGAGGC 

CTAGGGAGGC 
********** 



CGGTGCTAGA 

CGGTGCTAGA 

CGGTGCTAGA 

CGGTGCTAGA 

CGGTGCTAGA 

CGGTGCTAGA 

CGGTGCTAGA 

CGGTGCTAGA 

CGGTGCTAGA 
********** 



CCAATTAGAA 

CCAATTAGAA 

CCAATTAGAA 

CCAATTAGAA 

CCAATTAGAA 

CCAATTAGAA 

CCAATTAGAA 

CCAATTAGAA 

CCAATTAGAA 
********** 



TTTTAAGAGC 

TTTTAAGAGC 

TTTTAAGAGC 

TTTTAAGAGC 

TTTTAAGAGC 

TTTTAAGAGC 

TTTTAAGAGC 

TTTTAAGAGC 

TTTTAAGAGC 
********** 



600 

ACCTAATTCT 

ACCTAATTCT 

ACCTAATTCT 

ACCTAATTCT 

ACCTAATTCT 

ACCTAATTCT 

ACCTAATTCT 

ACCTAATTCT 

ACCTAATTCT 
********** 



msa20031 . 2{ 
msa20031 . 
msa20031 . 
msa20031 - 2{ 
msa20031. 
msa20031.2{l00 
msa20031 . 
msa20031. 
msa20031 



100_18RS21 
2{100_2603" 
2{100_A909 
100_CJB110 
2{100__COH1 
_JM9130013 
2{100_M732 
2{100_M781] 
2{100_090] 
Consensus 



601 

ACAAATCTAC 
ACAAATCTAC 
ACAAATCTAC 
ACAAATCTAC 
ACAAATCTAC 
ACAAATCTAC 
ACAAATCTAC 
ACAAATCTAC 
ACAAATCTAC 



CTTTACCAGA 
CTTTACCAGA 
CTTTACCAGA 
CTTTACCAGA 
CTTTACCAGA 
CTTTACCAGA 
CTTTACCAGA 
CTTTACCAGA 
CTTTACCAGA 



AGCTGGAGGC 
AGCTGGAGGC 
AGCTGGAGGC 
AGCTGGAGGC 
AGCTGGAGGC 
AGCTGGAGGC 
AGCTGGAGGC 
AGCTGGAGGC 
AGCTGGAGGC 



TACGGTGAAG 
TACGGTGAAG 
TACGGTGAAG 
TACGGTGAAG 
TACGGTGAAG 
TACGGTGAAG 
TACGGTGAAG 
TACGGTGAAG 
TACGGTGAAG 



650 

AGGTTCTCGA 
AGGTTCTCGA 
AGGTTCTCGA 
AGGTTCTCGA 
AGGTTCTCGA 
AGGTTCTCGA 
AGGTTCTCGA 
AGGTTCTCGA 
AGGTTCTCGA 



********** ********** ********** ********** ********** 



msa20031 . 2 { 100_18RS21 } 
msa20031. 2f lb"0_2603* 
msa20031. 2{100_A909 

msa20031.2{l00_CJB110 
msa20031. 2{l00_COHl} 
msa20031.2{lOO_JM9130013l 
msa2 0031 . 2 { 100_M732 } 



651 663 

AAATTCAGCT TAC 

AAATTCAGCT TAC 

AAATTCAGCT TAC 

AAATTCAGCT TAC 

AAATTCAGCT TAC 

AAATTCAGCT TAC 

AAATTCAGCT TAC 
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msa20031.2{l00_M78l} AAATTCAGCT TAC 
msa20031.2{l00_090} AAATTCAGCT TAC 
Consensus ********** *** 

SEQ ID NO 4010 : SAGO 653 PROM THE 2603 V/R GBS TYPE V STRAIN 

KGPKVAYTQEGMTALSDTNKDKVTTI S IDE IQKSLEGKKPITVSFDI DDTLLFSSQYFQY 
GKEYVTPGSFDFLHKQKFWDLVAKRGDQDSIPKEYAKKLIAMHQKRGDKIVFITGRTRGS 
MYKEGEVDKTAKAIAKDFKLDKPI AVNYTGDKPKKPYKYDKS 
IHAAREAGARPI RI LRAPNSTNLPLPEAGGYGEEVLENSAY 

SEQ ID NO 4011 : SAGO 653 FROM THE 090 GBS TYPE III STRAIN 

KGPKVAYTQEGMTAIiSDTNKDKVTT I S IDE IQKSLEGKKPITVSFDI DDTLLFSSQYFQY 
GKEYVTPGSFDFLHKQKFWDLVAKRGDQDS I PKEYAKKLI AMHQKRGDKI VFITGRTRGS 
MYKEGEVDKTAKALAKDFKLDKPIAVNYTGDKPKKPYKYDKSYYIKKYGSDIHYGDSDDD 
IHAAREAGARPIRI LRAPNSTNLPLPEAGGYGEEVLENSAY 

SEQ ID NO 4012 : SAGO 6 53 FROM THE A909 GBS TYPE la STRAIN 
KGPKVAYTQEGMTAL SDTNKDKVTT I S I DE I QKSLEGKKP I TVS FD I DDTLLFS S QYFQ Y 
GKEYVTPGSFDFLHKQKFWDLVAKRGDQDS I PKEYAKKLI AMHQKRGDKI VFITGRTRGS 
MY KEGEVDKTAKALAKDFKLDKPI AVNYTGDKPKKPYKYDKS YYI KK^ 
IHAAREAGARPIRI LRAPNSTNLPLPEAGGYGEEVLENSAY 

SEQ ID NO 4013 : SAGO 6 53 FROM THE 18RS21 GBS TYPE II STRAIN 

KGPKVAYTQEGMTALSDTNKDKVTTI S IDE I QKSLEGKKP I TVS FDIDDTLLFS SQYFQY 
GKEYVTPGSFDFLHKQKFWDLVAKRGDQDS I PKEYAKKLI AMHQKRGDKI VFITGRTRGS 
MYKEGEVDKTAKALAKDFKLDKPIAVNYTGDKPKKPYKYDKSYYI KKYGSD I HYGDSDDD 
IHAAREAGARPIRI LRAPNSTNLPLPEAGGYGEEVLENSAY 

SEQ ID NO 4014 : SAGO 653 FROM THE COH1 GBS TYPE III STRAIN 

KGPKVAYTQEGMTALSDTNKDKVTTI S I DE I QKSLEGKKP ITVS FDIDDTLLFS SQYFQY 
GKEYVTPGSFDFLHKQKFWDLVAKRGDQDS I PKEYAKKLI AMHQKRGDKI VFITGRTRGS 
MYKEGEVDKTAKALAKD FKLDKP I AVNYTGDKPKKPYKYDKS YYI KKYGSD I HYGDSDDD 
IHAAREAGARPI RILRAPNSTNLPLPEAGGYGEEVLENSAY 

SEQ ID NO 4015 : SAGO 653 FROM THE M781 GBS TYPE III STRAIN 

KGPKVAYTQEGMTALSDTNKDK^/TT IS IDE I QKSLEGKKP ITVS FDIDDTLLFSSQYFQY 
GKEYVTPGSFDFLHKQKFWDLVAKRGDQDS I PKEYAKKL I AMHQKRGDKI VF ITGRTRGS 
MYKEGEVDKTAKALAKD FKLDKP I AVNYTGDKPKKPYKYDKS YY I KKYGSDI HYGDSDDD 
IHAAREAGARPI RI LRAPNSTNLPLPEAGGYGEEVLENSAY 

SEQ ID NO 4016 : SAG0653 FROM THE CJB110 GBS NONTYPEABLE STRAIN 

KGPKVAYTQEGMTALSDTNKDKVTTISIDEIQKSLEGKKPITVSFDIDDTLLFSSQYFQY 
GKEYVTPGS FDFLHKQKFWDLVAKRGDQDS I PKEYAKKLI AMHQKRGDKI VF ITGRTRGS 
MYKEGEVDKTAKALAKDFKLDKPI AVNYTGDKPKKPYKYDKS YY I KKYGSD I HYGDSDDD 
IHAAREAGARPI RI LRAPNSTNLPLPEAGGYGEEVLENSAY 

SEQ ID NO 4017 : SAGO 653 FROM THE JM9130013 GBS TYPE VIII STRAIN 

KGPKVAYTQEGMTALSDTNKDKVTT I S IDE I QKSLEGKKPITVSFD I DDTLLFSSQYFQY 
GKEYVTPGSFDFLHKQKFWDLVAKRGDQDS I PKEYAKKLI AMHQKRGDKI VF ITGRTRGS 
MYKEGEVDKTAKALAKD FKLDKP I AVNYTGDKPKKPYKYDKS YYI KKYGSD 
IHAAREAGARPI RI LRAPNSTNLPLPEAGGYGEEVLENSAY 

SEQ ID NO 4018 : SAG0653 FROM THE M732 GBS TYPE III STRAIN 

KGPKVAYTQEGMTALSDTNKDKVTTI SIDE I QKSLEGKKPITVSFD I DDTLLFSSQYFQY 
GKEYVTPGSFDFLHKQKFWDLVAKRGDQDS I PKEYAKKLI AMHQKRGDKI VF ITGRTRGS 
MYKEGEVDKTAKALAKD FKLDKPIAWYTGDKPKKPYKYDKS YYI KKYGSDIHYGDSDDD 
IHAAREAGARPI RI LRAPNSTNLPLPEAGGYGEEVLENSAY 
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PRETTY of: /biotmp/msa25122 . 2 { * } August 5, 2002 07:09 



, msa25122 .2{l00__090} 
msa25122 . 2 { 100_18RS21 } 
msa25122.2{l00__2603} 
msa25122 . 2 {100_A909 } 
msa25122.2{lOO_CJB110} 
msa25122 .2{l00_COHl} 
msa25122 . 2 {100_JM9130013 } 
msa25122 .2{l00_M732} 
msa25122.2{l00_M78l} 
Consensus 



KGPKVAYTQE 
KGPKVAYTQE 
KGPKVAYTQE 
KGPKVAYTQE 
KGPKVAYTQE 
KGPKVAYTQE 
KGPKVAYTQE 
KGPKVAYTQE 
KGPKVAYTQE 



GMTALSDTNK 
GMTALS DTNK 
GMTALSDTNK 
GMTALSDTNK 
GMTALSDTNK 
GMTALSDTNK 
GMTALSDTNK 
GMTALSDTNK 
GMTALSDTNK 



DKVTTISIDE 
DKVTTISIDE 
DKVTTISIDE 
DKVTTISIDE 
DKVTTISIDE 
DKVTTISIDE 
DKVTTISIDE 
DKVTTISIDE 
DKVTTISIDE 



IQKSLEGKKP 
IQKSLEGKKP 
IQKSLEGKKP 
IQKSLEGKKP 
IQKSLEGKKP 
IQKSLEGKKP 
IQKSLEGKKP 
IQKSLEGKKP 
IQKSLEGKKP 



********** ********** ********** ********** 



50 

I TVS FD I DDT 

ITVSPDIDDT 

ITVSFDIDDT 

ITVSPDIDDT 

ITVSFDIDDT 

ITVSFDIDDT 

ITVSFDIDDT 

ITVSFDIDDT 

ITVSFDIDDT 
********** 



msa25122 
msa25122 . 2{ 
msa25122 - 
msa25122 . 
msa25122 . 2{ 
msa25122 . 
msa25122 . 2 {100 
msa25122 . 
msa25122 . 



.2{100_090 
100_18RS21 
2{100_2603} 
2{100_A909} 
100_CJB110j 
2{100_COH1} 
_JM9130013} 
2{l00_M732} 
2{100_M781} 
Consensus 



51 

LLFSSQYFQY 

LLFSSQYFQY 

LLFSSQYFQY 

LLFSSQYFQY 

LLFSSQYFQY 

LLFSSQYFQY 

LLFSSQYFQY 

LLFSSQYFQY 

LLFSSQYFQY 
********** 



GKEYVTPGSF 

GKEYVTPGSF 

GKEYVTPGSF 

GKEYVTPGSF 

GKEYVTPGSF 

GKEYVTPGSF 

GKEYVTPGSF 

GKEYVTPGSF 

GKEYVTPGSF 
********** 



DFLHKQKFWD 
DFLHKQKFWD 
DFLHKQKFWD 
DFLHKQKFWD 
DFLHKQKFWD 
DFLHKQKFWD 
DFLHKQKFWD 
DFLHKQKFWD 
DFLHKQKFWD 



LVAKRGDQDS 
LVAKRGDQDS 
LVAKRGDQDS 
LVAKRGDQDS 
LVAKRGDQDS 
LVAKRGDQDS 
LVAKRGDQDS 
LVAKRGDQDS 
LVAKRGDQDS 



********** ********** 



100 

IPKEYAKKLI 

I PKE YAKKL I 

IPKEYAKKLI 

IPKEYAKKLI 

IPKEYAKKLI 

IPKEYAKKLI 

IPKEYAKKLI 

IPKEYAKKLI 

IPKEYAKKLI 
********** 



msa25122 
msa25122 . 2 { 
msa25122 . 
msa25122 . 
msa25122 ,2{ 
msa25122 . 
msa25122 . 2{l00 
msa25122 . 
msa25122 . 



.2{l00_090 
100 18RS21 
2{100_2603 
2{100_A909} 
100_CJB110~ 
2{100_COH1 
_JM9130013 
2{100_M732} 
2{100_M781} 
Consensus 



101 

AMHQKRGDKI 

AMHQKRGDKI 

AMHQKRGDKI 

AMHQKRGDKI 

AMHQKRGDKI 

AMHQKRGDKI 

AMHQKRGDKI 

AMHQKRGDKI 

AMHQKRGDKI 
********** 



VF I TGRTRGS 

VFITGRTRGS 

VFITGRTRGS 

VFITGRTRGS 

VFITGRTRGS 

VFITGRTRGS 

VFITGRTRGS 

VFITGRTRGS 

VFITGRTRGS 
********** 



MYKEGEVDKT 
MYKEGEVDKT 
MYKEGEVDKT 
MYKEGEVDKT 
MYKEGEVDKT 
MYKEGEVDKT 
MYKEGEVDKT 
MYKEGEVDKT 
MYKEGEVDKT 
********** 



AKALAKDFKL 

AKALAKDFKL 

AKALAKDFKL 

AKALAKDFKL 

AKALAKDFKL 

AKALAKDFKL 

AKALAKDFKL 

AKALAKDFKL 

AKALAKDFKL 
********** 



150 

DKPIAVNYTG 

DKPIAVNYTG 

DKPIAVNYTG 

DKPIAVNYTG 

DKPIAVNYTG 

DKPIAVNYTG 

DKPIAVNYTG 

DKPIAVNYTG 

DKPIAVNYTG 
********** 



msa25122 
msa25122 .2{ 
msa25122 . 
msa25122 . 
msa25122.2{ 
msa25122 . 
msa25122 . 2{l00 
msa25122 . 
msa25122 . 



.2{100_090; 
100_18RS21; 
2{100_2603 
2{100_A909 
100_CJB110 
2{l00_COHi; 
_JM9130013] 
2{100_M732 
2{100_M781 
Consensus 



151 

DKP KKPYKYD 

DKPKKPYKYD 

DKP KKPYKYD 

DKPKKPYKYD 

DKPKKPYKYD 

DKPKKPYKYD 

DKPKKPYKYD 

DKPKKPYKYD 

DKPKKPYKYD 
********** 



KSYYIKKYGS 

KSYYIKKYGS 

KSYYIKKYGS 

KSYYIKKYGS 

KSYYIKKYGS 

KSYYIKKYGS 

KSYYIKKYGS 

KSYYIKKYGS 

KSYYIKKYGS 
********** 



DIHYGDSDDD 

DIHYGDSDDD 

DIHYGDSDDD 

DIHYGDSDDD 

DIHYGDSDDD 

DIHYGDSDDD 

DIHYGDSDDD 

DIHYGDSDDD 

DIHYGDSDDD 
********** 



IHAAREAGAR 
IHAAREAGAR 
IHAAREAGAR 
IHAAREAGAR 
IHAAREAGAR 
IHAAREAGAR 
IHAAREAGAR 
IHAAREAGAR 
IHAAREAGAR 



200 

PIRILRAPNS 
PIRILRAPNS 
PIRILRAPNS 
PIRILRAPNS 
PIRILRAPNS 
PIRILRAPNS 
PIRILRAPNS 
PIRILRAPNS 
PIRILRAPNS 



********** ********** 



msa25122 
msa25122 . 2{ 
msa25122 . 
msa25122 . 
msa25122 .2{ 
msa25122 
msa25122.2{l00 
msa25122. 
msa25122 . 



.2{100_090 
100_18RS21 
2{100_2603; 
2{100_A909 
100_CJB110 
2 { 100_COH1 
_JM9130013 
2f 100_M732 
2{100_M78l' 
Consensus 



201 

TNLPLPEAGG 

TNLPLPEAGG 

TNLPLPEAGG 

TNLPLPEAGG 

TNLPLPEAGG 

TNLPLPEAGG 

TNLPLPEAGG 

TNLPLPEAGG 

TNLPLPEAGG 
********** 



221 

YGEEVLENSA Y 
YGEEVLENSA Y 
YGEEVLENSA Y 
YGEEVLENSA Y 
YGEEVLENSA Y 
YGEEVLENSA Y 
YGEEVLENSA Y 
YGEEVLENSA Y 

YGEEVLENSA Y 
********** * 



730 



WO 2004/018646 PCT/US2003/026827 



Table 41: Comparative Sequences relating to SAG0649 



SEQ ID NO. 4101: SAG0649 FROM 2603 V/R GBS TYPE V STRAIN 

ATGAAAAAGAGACAAAAAATA 

TGGAGAGGGTTATCAGTTACTITTACTAATCCTGT C CCAAATT C 

CAAGGTG AAAC C CAAGATAC CAATCAAG CACT TGG AAAAGTAATTGTTAAAAAAACGGGA 
GACAATG CT ACACCATTAGG CAAAG CG ACTTTTGTGTTAAAAAATGACAATG ATAAGT CA 
\ GAAACAAGT CACGAAACGGT AGAGG GTT CTGGAG AAG CAACCTTTGAA^CATAAAAC CT 
GGAGACTACACATTAAGAGAAGAAACAGGaCGAATTGGTTATAAAAAAACTGATAAAACC 
TGGAAAGTTAAAGTTGCAGATAACGGAGCAACAATA^TCGAGGGTATGGATGCAGATAAA 
G C^G AG AAACGAAAAGAAGTTTTG AATG C CCAAT ATC CAAAATCAGCTATTT ATGAG GAT 
ACAAAAGAAAATTAC CCATTAGTT AATGT AGAGGGTT CCAAAGTTGGTGAACAAT ACAAA 
GCATT GAAT C CAATAAATGG AAAAG ATGGTCG AAG AG AG ATTGCTGAAGGTTGGTTAT CA 
AAAAAAATTACAGGGGT CAATG AT CT CGATAAG AATA^TAT AAAATTG AATTAACTGTT 
GAGGGTAAAACCACTGTTGAAACGAAAGAACTTAATCA^CCACTAGATGTCGTTGTGCT 
TTAG ATAATT CAAAT AGTATGAAT AATGAAAG AG C CAAT AATTCT CAAAG AG CATTAAAA 
GCTGGGGAAG CAGTTGAAAAG CTG ATTGATAAAATTACAT CAAATAAAGACAATAGAGTA 
G CTCTTGTG ACAT ATG C CT CAACCATTTTTGATGGTACTGAAGCG AC CGT AT CAAAGGGA 
GTTG CCG AT CAAAATGGTAAAG CG CTG AATGATAGTGTAT CATGGGATTATCATAAAACT 
ACTTTTACAG CAACTACACATAATTACAGTTATTTAAATT TAACAAATGATG CTAACGAA 
GTTAATATTCTAAAGTCAAGAATT CCAAAGGAAG CGG AG CAT ATAAATGGGG AT CG CACG 
CTCTATCAATTTGGTGCGACATTTACTCAAAAAGCTCTAATGAAAGCAAATGAAAT^ 
GAGACACAAAGTTCTAATG CT AGAAAAAAACTTATT1TT CACGTAACTG ATGGTGTC CCT 
ACGATGT CTTATG CCATAAATTTTAAT CCTTATATAT CAACA.T CITAC CAAAACCAGTTT 
AATTCTTTTTTAAATAAAATAC CAG AT AGAAGTGGTATT CTC CAAGAGGATTTT AT AAT C 
AATGGTGATG AT TATCAAAT AGTAAAAGG AGATGG AGAG AGTTTT AAACTGTTTT CGGAT 
AGAAAAGTT C CTGTTACTGG AGGAACG ACACAAG CAGCTTAT CGAGTACCGCAAAAT CAA 
CTCTCTGTAATGAGTAATGAGGGATATGCAATTAATAGTGGATATATTTATCTCTATTGG 
AGAGATTACAACTGG GT CTAT C CATTTGATCCTAAGACAAAG AAAGTTTCTG CAACGAAA 
CAAAT CAAAACTCATGGTG AG C CAACAACATT ATACITTAATGG AAATATAAGAC CTAAA 
GGTTATGACATTTTTACTGTTGGGATTGGTGTAAACGGAGATCCTGGTGCAACTCCT 
GAAG CTG AGAAATTTATGCAAT CAATATCAAGTAAAACAG AAAATTATACTAATGTTGAT 
GATA CAAAT AAAATTTATGATG AG CTAAATAAATACTTTAAAACAATTGTTG AG G AAAAA 
CATTCTATTGTTGATGGAAATGTGACTGATCCTATGGGAGAGATGATTGAATTCCAAT^ 
AAAAATGGT CAAAGTTTTACACATG ATGATTACGTTTTGGTTGGAAATGATC CAA 
TTAAAAAATGGTGTGGCTCTTGGTGGACCAAACAGTGATGGGGGAATTTTAAAAGATGTT 
ACAGTGACTTATGATAAGACAT CT CAAACCAT CAAAATCAAT CATTTGAACTTAGGAAGT 
GGACAAAA^GTAGTTCTTACCTATGATGTACGTTTAAAAGATA^CTATATAAGTAACAAA 
TTTTACAATACAAATAATCGTACAACGCTAAGTCCGAAGAGTGAAAAAGAACCAAATACT 
ATTCGTGATTTCCCAATTCCCAAAATTCGTGATGTTCGTGAGTTTCCGGTACTAACa\TC 
AGTAATCAGAAGAAAATGGGTGAGGTTGAATTTATTAAAGTTAATAAAGACAAAG^TTCA 
GAATCG CTTTTGGG AG CTAAGTTT CAACT T CAGAT AG AAAAAGATTTTT CTGGGTAT AAG 
CAATTTGTTCCAGAGGGAAGTGATGTTACAACAAAGAATGATGGTAAAATT^ 
G CACTT CAAG ATGGTAACTATAAATTATATGAAATTT CAAGT CCAGATGGCTAT ATAGAG 
GTTAAAACGAAACCTGTTGTGACATTTACAATTCAAAA.TGGAGAAGTTACGAACCTGAAA 
GCAGATCCAAATGCTAATAAAAATCAAATCGGGTATCTTGAAGGAAATGGTAAACATCTT 
ATT AC CAACACT C CCAAACG CCCAC CAGGTGTTTTTC CTAAAACAGGGGG AATTGGTACA 
ATTGT CTAT ATATTAGTTG GTTCTACTTTTATGAT ACTTACCATTTGTT CTTTC CGT CGT 
AAACAATTG 



SEQ ID NO. 4102: SAG0649 FROM 090 GBS TYPE la STRAIN 

GGTGAAACCCAAGAT AC CAAT CAAG CACTTGG AAAAG 

TAATTGTTAAAAAAACGGG AGACAATG CTACAC CATTAGG CAAAG CG ACT 

TTTGTGTTAAAAAATGACAATGATAAGTCAGAAACAAGT CACGAAACGGT 

AGAGGGTTCTGGAGAAGCAACCTTTGAAAACATAAAACCTGGAGACTACA 

CATTAAGAGAAGAAACAGCACCAATTGGTTATAAAAAAACTGATAAAACC 

TGGAAAGTTAAAGTTGCAGATAACGGAGCAACAATAATCGAGGGTATGGA 

TG CAGAT AAAG CAG AGAAACGAAAAGAAGTTTTGAATGCC CAATATCCAA 

AATCAGCTTATTTATGAGGATACAAAAGAAAATTACCCATTAGTTAATGTA 

GAGGGTT CCAAAGTTGGTG AACAAT ACAAAG CATTGAATC CAATAAATGG 

AAAAGATGGTCGAAGAGAGATTGCTGAAGGTTGGTTATCAAAAAAAATTA 

CAGGGGTCAATGATCTCGATAAGAATAAATATAAAATTGAATTAACTGTT 

GAGGGTAAAACCACTGTTGAAACGAAAGAACTTAATCAACCACTAGATGT 

CGTTGTGCTATTAGATAATTCAAATAGTATGAATAATGAAAGAGCCAATA 

ATTCTTCAAAGAGCATTAAAAGCTGGGGAAGCAGTTGAAAAGCTGATTGAT 

AAAATTACATCAAATAAAGACAATAGAGTAGCTCTTGTGACATATGCCTC 

AACCATTTTTGATGGTACTGAAGCGACCGTATCAAAGGGAGTTGCCGATC 

AAAATGGTAAAG CG CTGAATGATAGTGTAT CATGGGATTATCATAAAACT 

ACTTTTACAGCAACTACACATAATTACAGTTATTTAA^TTTAACAAATGA 

TG CTAACGAAGTTAATATT CTAAAGTCAAG AATT C CAAAGG AAG CGG AGC 

ATATAAATGGGGATCGCACGCTCTATCAATTTGGTGCGACATTTACTCAA 

AAAGCTCTAATGAAAGCAAATGAAATTTTAGAGACACAAAGTTCTAATGC 

TAGAAAAAAACTTATTTTTCACGTAACTGATGGTGTCCCTACGATGTCTT 

ATGCCATAAATTTTAATCCTTATATATCAACATCTTACCAAAACCAGTTT 

AATT CTTTT TTAAAT AAAAT AC CAG AT AG AAGTG GT ATTCT C CAAG AGG A 

TTTTATAATCAATGGTGATGATTATCAAATAGTAAAAGGAGATGGAGAGA 

GTTTTAAACTGTTTTCGGATAGAAAAGTTCCTGTTACTGGAGGAACGACA 

CAAGCAGCTTATCGAGTACCGCAAAATCAACrCrCTGTAATGAGTAATGA 

GGGATATGCAATTAATAGTGGATATATTTaTCTCTATTGGAGAGATTACA 

ACTGGGTCTATCCATTTGATCCTAAGACAAAGAAAGTTT CTG CAACGAAA 

CAAAT CAAAACT CATGGTG AG C CAACAACATTATACTTTAATGG AAATAT 

AAGACCTAAAGGTTATGACATTTTTACTGTTGGGATTGGTGTAAACGGAG 

ATCCTGGTGCAACTCCTCTTGAAGCTGAGAAATTTATGCAATCAATATCA 

AGTAAAACAGAAAATTATACTAATGTTGATGATACAAATAAAATTTATGA 

TG AG CTAAATAAATACTTTAAAACAATTGTTG AGG AAAAACATT CTATTG 



i 
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TTGATGG AAATGTG ACTGAT CCTATGGGAG AGATGATTG AATT CCAATTA 
AAAAATGGTCAAAGTTTTACACATGATGATTACG t TTTGG tTGGAAATGA 
tGGC^GTCAATTAAAAAATGGTGTGGCTCTTGGTGGACCAAACAGTGATG 
GGGG A^TTTTAAAAGATGTT ACAGTGACTTATGATAAGACATCT CAAAC C 
ATCAAAATCAATCATTTGAACTTAGGAAGTGGACAAAAAGTAGTT CTTAC 
CT ATG AT GT A CG TTT AAAAGAT AACT AT ATAAGT AACAAATTTT ACAAT A 
CAAATAATCGTACAACGCTAAGTC CGAAG AGTGAAAAAGAAC CAAAT ACT 
ATTCGTGATTTCCCAATTCCCAAAATTCGTGATGTTCGTGAGTTTCCGGT 
ACTAACCATCAGTAATCAGAAGAAAATGGGTGAGGTTGAATTTATTAAAG 
TTAATAAAGACAAACATTCAGAATCGCTTTTGGGAGCTAAGTTTCAACTT 
CAGAT AGAAAAAGATTTTT CTG GGTATAAG CAATTTGTT C CAGAGGG AAG 
TGATGTTACAACAAAGAATGATGGTAAAATTTATTTTAAAGCACTTCAAG 
ATGGT AACTATAAATT AT ATGAAATTT CAAGTC CAGATGG CTATATAGAG 
GTTAAAACGAA^CCTGTTGTGACATTTACAATTCAAAATGGAGAAGTTAC 
GAACCTGAAAGC7VGATCCAAATGCTAATAAAAATCy\AATCGGGTATCTTG 
AAGGAAATG GTAAACATCTTATTACCAACACTCCCAAACGCCCACCAGGT 
GTT 

SEQ ID NO. 4103: SAG0649 PROM A909 GBS TYPE la STRAIN 

GGTGAAACCCAAGATACCAATCAAGCACTTGGAAAA 
GTAATTGTTAAAAAAACGGGGGACAATGCTACACCATTAGGCAAAGCGAC 
TTTTGTGTTAAAAAATG ACAATGATAAGTCAgAAACAAGT CACG AAA.CGG 
TAGAGGGTT CTGGAG AAgCAAC CTTTG AAAACAT AAAAC CTG GAG ACTAC 
ACATTAAGAG AAGAAACAG CACCAATTGGTTATAAAAAAACTGAT AAAAC 
CTGG AAAGTTAAAGTTG CAGATAACGGAGCAACAATAATCGAGGGTATGG 
ATGCAGATAAAG CAG AGAAACX3 AAAAGAAGTTTTGAATG C CCAAT AT C CA 
AAAT CAGCTATTTATGAGGATACAAAAGAAAATTACC CATTAgTT AA.TGT 
AGAGGGTTCCAAAGTTGGTGAACAATACAAAGCAT^ 

GAAAAGATGGTCGAAGAGAGATTG CTG AAGGTTGGTTAT CAAAAAAAATT 
ACAGGGGTCAATGATCrrCGATAAGAATAAATATAAAATTGA^TTAACTGT 
TGAGGGTAAAAC CACTGTTG AAACG AAAGAACTTAAT CAACCACTAG ATG 
TCGTTGTGCTATTAGATAATTCAAATAGTATGAATAATGAAAGAGCCAAT 
AATTCT CAAAGAG CATT AAAAG CTGGGGAAG CAGTTG AAAAG CTG ATTGA 
TAAAATTACATCAAATAAAG ACAATAGAGTAG CT CTTGTG ACATATG CCT 
CAACCATTTTTGATGGTACTGAAGCGACCGTATCAAAGGGAGTTGCCGAT 
CAAAATX3GTAAAGCGCTGAATGATAGTGTATCATGGGATTATCATAAAAC 
TACTTTTACAGCAACTACACATAATTA 

ATGCTAACGAAGTTAATATT CTAAAGT CAAGAATTCCAAAGGAAGCGGAG 

CATAT AAATGGGGATCG CACG CTCTAT CAATTTGGTG CG ACATTTACTCA 

AAAAG CTCTAATGAAAG GAAATGAAATTTTAGAGACA.CAAAGTTCTAATG 

CTAGAAAAAAACTT ATTTTT CACGTAACTGATGGTGTCC CTACGATGTCT 

TATG CCATAAATTTT AATCCTTATATATCAACAT CTTACCAAAAC CAGTT 

TAATT CITTTTTAAAT AAAATAC CAGATAGAAGTGGTATT CT CCAAG AGG 

ATTTTATAATCAATGGTGATGATTATCAAATAGTAAAAGGAGATGGAGAG 

AGTTTTAAACTGTTTTCGG ATAGAAAAGTT C CTGTTACTG GAGG AACGAC 

ACAAG CAGCTTATCGAGTAC CGCAAAATCAACTCT CTGT AATGAGTAATG 

AGGGATATGCAATTAATAGTG<3ATATATTTATCTCTATTGGAGAGATTAC 

AACTGGGTCTATCCATTTGATCCTAAGACAAAGAAAGTTTCTGCAAC 

ACAAATCAAAACTCATGGTGAGCCAACAACATTATACTTTAATGGAAATA 

TAAGACCTAAAGGTTATGAC^TTTTTACTGTTGGGATTGGTGTAAACGGA 

GATC CTGGTG CAACT C CT CTTG AAG CTGAG AAATTTATG CAAT CAATAT C 

AAGTAAAACAGAAAATTATACTAATGTTGATGATACAAATAAAATTTATG 

ATGAG CTAAATAAAT ACTTTAAAACAATTGTTGAGGAAAAACATTCTATT 

GTTG ATGGAAATGTG ACTG ATCCTATG GG AGAGATGATTG AATT CCAATT 

AAAAAATGGTCAAAGTTTTACACATGATGATTACGtTTTGGtTGGAAATG 

AtGGCAGTCAATTAAAAAATGGTGTGGCTCTTGGTGGACCAAACAGTGAT 

GGGGGAATTTTAAAAGATGTTACAGTGACTTATGATAAGACATCTCAAAC 

CATCAAAAT CAATCATTTGAACTITAGGAAGTGGACAAAA^GTAGTTCTTA 

CCTATGATGTACGTTTAAAAGATAACTATATAAGTAACAAATTTTACAAT 

ACAAATAATCGTACAACGCTAAGTCCGAAGAGTGAAAAAGAACCAAATAC 

TATTCGTGATTT'CCCAATTCCCAAAATTCGTGATGTTCGTGAGTTTCCGG 

TACTAAC CATCAGT AAT CAGAAGAAAATGGGTGAGGTTG AATTTATTAAA 

GTTAATAAAGACAAACATTCAGAATCGCTTTTGGGAGCT 

TO\G ATAGAAAAAG ATTTTTCTGGGTATAAG CAAT TTGTT C CAGAGGGAA 

GTGATGTTAC^CAAAGAATGATGGTAAAATTTATTTTAAAGCACT^ 

GATGGTAACT ATAAATTAT ATGAAATTTCAAGT C CAGAT G G CTATAT AGA 

GGTTAAAACGAAAC CTG TTGTGACATTTACAATT CAAAATGG AG AAGTTA 

CGAAC CTGAAAG CAG ATCCAAATG CTAATAAAAAT CAAAT CGGGT AT CTT 

GAAGG AAATGGTAAACATCTTATT ACCAAGACT C C CAAACGC CCAC CAGG 

TGTT 

SEQ ID NO. 4104: SAG0649 FROM 18RS21 GBS TYPE II STRAIN 

GGTGAAACCCAAGATACCAATCAAGCAC 

TTGGAAAAGTAATTGTTAAAAAAACGGGAGACAaTGCTACACCaTTAGGC 
AAAG CGACTTTTCTGTTAAAAAATGACAATGATAAGT CAG AAACAAGT CA 
CGAAACGGTAGAGGGTTCTGGAGAAgCAACCTTTGAAAACATAAAACCTG 
GAGACTACACATTAAGAGAAGAAACAG CAC CAATTGGTTATAAAAAAACT 
GATAAAAC CTGG AAAGTTAAAGTTG CAG AT AACG GAG CAACAAT AAT CG A 
GGGTATGG ATG CAG ATAAAG CAGAG AAACG AA aAG AAGTTTTG AATG CC C 
AATAT CCAAAAT CAG CT ATTT ATG AGG ATACAAAAGAAAATT AC C CATTA 
GTTAATGTAGAGGGTTCCAAAGTTGGTGAACAATACAAAGCATTGAATCC 
AATAAATGGAAAAGATGGTCGAAGAGAGATTGCTGAAGGTTGGTTATCAA 
AAAAAAT Ta C aG GGGT CAATGAT CTCG AT AAG AATAAAT ATAAAATTGAA 
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TTAACTGTTGAGGGTAAAACCACTGTTGAAACGAAAGAACTTAATCAACC 
ACTAGATGTCGTTGTGCTATTAGATAATTCAAATAGTATGAATAATGAAA 
GAGC CAATAATT CT CAAAGAG CATTAAAAG CTGGG GAAG CAGTTGAAAAG 
CTGATTG ATAAAATT ACAT CAAAT AAAGACAATAG AGTAG CT CTTGTGAC 
ATATGCCTCAACCATTTTTGATGGTACraAAGCGACCGTATCAAAGGGAG 
TTG C CGATCAAAATGGTAA^G CG CT GAATG AT AGTGT AT CATGGG ATTAT 
CATAAAACTACTTTTACAGCAACTACACATAAT^ 

AACAAATGATG CT AACGAAGTT AATATTCT AAAGTCAAG AATTC CAAAG G 

AAGCGGAGCATATAAATGGGGATCGCACGCTCTATCAATTTGGTGCGACA 

TTTACTGAAAAAGCTCTAATGAAAGGAAATGAAATTTTAGAGACACAAAG 

TTCTAATGCrrAGAAAAAAA.CTTATTTTTCACGTAACTGATGGTGTCCCTA 

CG ATGTCTT ATG C CATAAATTTTAAT CCTT AT AT ATCAACAT CTT AC CAA 

AAC CAGTTTAATT CTTTTTT AAATAAAATAC CAG ATAGAAGTGGT ATT CT 

CCAAGAGGATTTTATAATCAATGGTGATGATTATCAAATAGTAAAAGGAG 

ATGGAGAGAGTTTTAAACTG1TTTCGGATAGAAAAGTTCCTGTTACTGGA 

GGAACGACACAAGCAGCTTATCGAGTACCGCAAAATCAACTCTCT 

GAGT AATGAGGG AT ATG CAATTAAT AGTGG AT ATATTTAT CTCTATTGGA 

GAGATTACAACTGGGT CTAT C CATTTG AT CCTAAGACAAAGAAAGTTT CT 

GCAACGAAACAAATCAAAACTCATGGTGAGCCAACAACATTAT^ 

TGGAAATATAAGACCTAAAGGTTATGACATTTTTACTGTTGGGATTGGTG 

TAAACGGAGATCCTGGTGCAA.CT C CTCTTGAAG CTGAGAAATTTATG CAA 

TCAATATCAAGTAAAACAGAAAA.TTATACTAATGTTGATGATACAAATAA 

AATTTATG ATGAGCTAAAT AAATAOTTTAAAACAATTGT TGAGG AAAAAC 

ATTCTATTGTTGATG GAAATGTGACTG AT C CTATGGG AGAGATG ATTGAA 

TTCCAATTAAAAAATGGTCAAAGTTTTACAC^TGATGATTACGTTTT 

TGGAA ATGATGGCAGT CAATTA A AAAATGGTGTGG CT CTTGGTGG AC CAA 

ACAGTGATGGGGGAATTTTAAAAGATGTTACAGTGACTTATGATAAGACA 

TCTCAAACCl^TCAAAATCAATCATTTGAACTTAGGAAGTGGACAAAAAGT 

AGTTCTTACCTATGATGTACGTTTAAAAGATAACTATATAAGTAACAAAT 

TTTACAATACAAAT AAT CGT ACAACG CTAAGTC CG AAGAGTG AAAAAGAA 

CCAAATACTAT t cG t gATT t C C CAATT CC CAAAATT CGTG ATGTTCGTGA 

GTTTCCGGTACTAACCATCAGTAATCAGAAGAAAATGGGTGAGGTTGA^T 

TTATTAAAGTTAATAAAGACAAAGATTCAGAATCGCT 

TTT CAACTT CAG ATAGAAAAAG ATTTTTCTGGGTATAAG CAATTTGTTC C 

AGAGGGAAGTGATGTTACAACAAAGAATGATGGTAAAATTTATTTTAAAG 

CACTTCAAG ATG GT A^CTAT AAATTATATGAAATTTCAAGTC CAG ATGG C 

TATATAGAGGTTAAAACGAAAC CTGTTGTG ACATTTACAATT CAAAATGG 

AGAAGTTACGAACCTG^AAAGCAGATCCAAATGCTAATAAAAATCAAATCG 

GGTATCTTGAAGGAAATGGTAAA.CATCTTATTACCAACACTCCCAAACGC 

CCACCAGGTGTT 

SEQ ID NO. 4105: SAG0649 FROM M732 GBS TYPE XXX STRAIN 

GGTG AAACCCAAGAT AC CAATCAAG CACT 

TGGAAAAGTAATTGTTAAAAAAACGGGAGACAaTGCTACACCATTAGGCA 
AAGCGACTTTTGTGTTAAAAAATGACAATGATAAGTC^GAAACAAGTCAC 
GAAACGGTAGAGGGTTCTGGAGAAGCAACCTTTGAAAACATAAAACCTGG 
AGACTACACATTA^GAGAAGAAACAGCACCAATTGGTTATAAAAAA\CTG 
AT AAAACCTGGAAAGTTAAAGTTG CAG ATAACGG AG CAACAATAAT CGAG 
GGTATGGATG CAGATAAAG CAG AGAAACGAAAAG AAGTTTTGAATG C C CA 
ATATCCAAAATCAGCTATTTATGAGGATACAAAAGAAAATTACCCATTAg 
TT AAT GTAGAGGGTTCCAAAGTTG GTGAACAATACAAAG CATTG AAT CCA 
ATAAATGGAAAAGATGGTCGAAGAGAGATTGCTGAAGGTTGGTTATCAAA 
AAAAAaTa CaGGGGTCAATGATCT CGATAAGAATAAATATAAAATTGAAT 
TAACTGTTG AGGGT AAAACCACTGTTGAAACGAAAGAACTTAAT CAAC CA 
CT AG ATGTCGTTGTG CTATTAGATAATT CAAAT AGTATGi\AT AAT GAAAG 
AG C CAAT AATT CTCAAAGAG CATT AAAaG CTGGG G AAGCAGTTG AAAAG C 
TGATTGATAAAATTACATCAAATAAAGACAATAGAGTAGCTCTTGTGACA 
TATG C CT CAACCATTTTTGATGGTACTGAAG CGAC CGT AT CAAAGGG AGT 
TGCCGATCAAAATGGTAAAGCGCTGAATGATAGTGTATCATGGGATTATC 
ATAAAACTACTTTTACAGCAACTACACA^ 

ACAAATG ATG CT AACGAAGTT AAT ATT CT AAAGT CAAGAATT CCAAAGG A 
AGCGGAGCATATAAATGGGGATCGCACGCTCTATCAATTTGGTGCGACAT 
TT ACT CAAAAAG CTCTAATG AAAG CAAATGAAATTTTAGAGACACAAAGT 
TCTAATGCTAGAAAAAAACTTATTTTTCACGTAACTGATGGTGTCCCTAC 
GATGT CTTATG C CATAAATTTTAAT C CTTATATAT CAACATCTT ACCAAA 
AC CAGTTTAATT CTTTTTT AAATAAAATAC CAGAT AG AAGTGGT ATT CT C 
CAAGAGGATTTTATAATCAATGGTGATGATTATCAAATAGTAAAAGGAGA 
TGGAGAGAGTTTTAAACTGTTTTCGGATAGAAAAGTTCCTGTTACTGGAG 
• GAACGACACAAGCAG CTTATCGAGTACCGCAAAAT CAACTCTCTGTAATG 
AGTAATGAGGGATATGCAATTAATAGTGGATATATTTATCTCTATTGGAG 
AGATTACAACTGGGTCTATCCATTTGATCCTAAGACAAAGAAAGTTTCTG 
CAACGAAACAAAT CAAAACT CATGGTG AG C CAACAA CATT ATACTTT AAT 
GG AAATATAAGACCTAAAGGTTATGACATTTTTACTGTTGGGATTGGTGT 
AAACGGAGATCCTGGTGCAACTC CTCTTGAAG CTGAGAAATTTATG CAAT 
CAATATCAAGTAAAACAGAAAATTATACTAATGTTGATGATACAAATAAA 
ATTTATGATGAGCTAAATAAATACTTTAAAACAATTGTTGAGGAAAAACA 
TTCTATTGTTGATGGAAATGTGACTGATCCTATGGGAGAGATGATTGAAT 
TC CAATT AAAAAATGGTCAAAGCTTTACACATGATGATTACGtTTTGGtT 
GGAAATGAtGGCAGTCAATTAAAAAATGGTGTGGCTCTTGGTGGACCAAA 
CAGTGATGGGGGAATTTTAAAAGATGTTACAGTGACTTATGATAAGACAT 
CTCAAACCATCAAAATCAATC^CTTGAACTTAGGAAGTGGACAAAAA.GTA 
GTTCTTACCTATGATGTACGTTTAAAAGATAACTATATAAGTAACAAATT 
TT ACAATACAAAT AAT CGT ACAa CG CT AAGT CCG AAG AGTGAAAAAG AAC 
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CAAATACTATTCGTGATTTCCCAATTCCCAAAATTCGTGATGTTCGTGAG 
TTTCCGGTACTAACCATCAGTAATCAGAAGAAAATGGGTGAGGTTGAAT^ 
TATTAAAGTTAATAAAG ACAAAOVTTCAGAAT CG CTTTTGGGAG CTAAGT 
TTCAACTTCAGATAGAAAAAGATTTTT 

GAGGGAAGTGATGTTACAACAAAGAATGATGGTAAAATTTATTTTAAAGC 
ACTT CAAGATGGTAACTATAAATTATATG AAATTTCAAGTCCAGATGG CT 
ATATAGAGGTTAAAACGAAACCTGTTGTGACATTTACAATTCAAAATGGA 
GAAGTTACG AAC CTGAAAG CAG AT CCAAATG CTAATAAAAAT CAAATCGG 
GTATCTTGAAGG AAATGGT AAACAT CTTATTACCAACACTCC CAAACG C C 
CACCAGGTGTT 

SEQ ID NO. 4106: SAG0649 FROM COH1 GBS TYPE XXI STRAIN 

GGTGAAACCCAAGATACCAATCAAGCACTTGGAAAAG 

TAATTGTTAAAAAAACGGGAGACAaTGCTACACCATTAGGCAAAGCGACT 

TTTGTGTTAAA^U\ATGACAATGAT AAGTCAGAAACAAGT CACGAAACGGT 

AGAGGGTTCTGGArAAGCAACCTTTGAAAACATAAAACCTGGAGACTACA 

CATTAAGAG AAG AAACAG CACCAATTGGTTATAAAAAAACTGATAAAAC C 

TGGAAAGTTAAAGTTGCAGATAACGGAGCAACAATAATCGAGGGTATGGA 

TGCAGATAAAGCAGAGAAACGAAAAGAAGTTTTGAATGCCCAATATCCAA 

AATCAG CTATTT ATG AGGATACAAAAGAAAATTAC CCATTAgTT AATGTA 

GAGGGTTCCAAAGTTGGTGAACAATaCAAAGCATTGAATCCAATAAATGG 

AAAAGATGGTCGAAGAGAGATTGCTGAAGGTTGGTTATCAAAAAAAAATA 

CAGGGGTCAATGATCTCgATAAGAATAAATATAAAATTGAATTAACTGTT 

GAGGGTAAAACCACTGTTGAAACX5AAAGAACTTAATCAACCACTAGATGT 

CGTTGTGCTATTAG ATAATT CAAATAGTATGAATAATGAAAG AG C CAATA 

ATTCT CAAAG AG CATTAAAAG CTGGGGAAG CAGTTGAAAAGCTGATTGAT 

AAAATTACAT CAAAT AAAGA CAAT AGAGT AG CTCTTGTGACATATG C CT C 

AACCATTTTTGATGGTACTGAAGCGACCGTATCAAAGGGAGTTG C CG AT C 

AAAATGGTAAAG CG CTG AATGATAGTGTAT CATGGGATTATCAT AAAACT 

ACTTTTACAGCAACTACACATAATTACAGTTATTTAAATTTAACAAA^ 

TG CTAACGAAGTTAATATTCTAAAGTCAAGAATT C CAAAGGAAG CGGAGC 

ATATAAATG GGG AT CGCACG CT CTATCAATTTGGTG CGACATTTACTCAA 

AAAG CTCTAATG AAAG CAAATGAAATTTTAGAGACACAAAGTTCTAATG C 

TAGAAAAAAACTTATTTTTCACGTAACTGATGGTGTCCCT^ 

ATGCCATAAATTTTAATCCTTATATATCAACATCIT'ACCAAAACCA 

AATTCTTTTTTAAATAAAATACCAGATAGAAGTGGTATTCTCCAAGAGGA 

TTTTATAATCAATGGTGATGATTATCAAATAGTAAAAGGAGATGGAGAGA 

Gl"riU'AAACTGTlTTCGGATAGAAAAGTTCCTGTTACTGGAGGAACGACA 

CAAGCAGCTTATCGAGTACCGGAAAATCAACTCTCTGTAATGAGTAATGA 

GGGATATGCAATTAATAGTGGATATATTTATCTCTATTGGAGAGATTACA 

ACTGGGTCTATCCATTTGATCCTAAGACAAA.GAAAGTTTCTGCA 

CAAAT CAAAACT CAT GG TGAG CCAACAACATT ATA CTTTAATGGAAATAT 

AAGACCTAAAGGTTATGACA1TTTTACTGTTGGGATTGGTGTAAA.CGGAG 

AT CCTGGTGCA^CT C CTCTTG AAG CTGAGAAATTTATG CAAT CAATATCA 

AGTAAAACAGAAAATTATACTAATGTTGATGATACAAATAAAATTTATGA 

TGAGCTAAATAAATACTTTAAAACAATTGTTGAGGAAAAACATTCTATTG 

TTGATGGAAATGTGACTGATCCTATGGGAGAGATGATTGAATTCCAATTA 

AAAAATGGTCAAAGTTTTACACATGATGATTAOSTTTTGGTTGGAAATGA 

TGGCAGTCAATTAAAAAATGGTGTGGCTCTTGGTGGACCAAACAGTGATG 

GGGG AATTTTAAAAG ATGTTACAGTGACTT ATGAT AAGACAT CT CAAAC C 

ATCAAAATCAAT CATTTGAACTTAGGAAGTGGACAAAAAGTAGTTCTTAC 

CTATGATGTACGTTTAAAAGATAACTATATAAGTAACAAATTTTACAATA 

CAAAT AAT CGTACAACG CTAAGTCCGAAGAGTGAAAA^GAACCAAATACT 

ATTCGTGATTTCCCAATTCCCAAAATTCGTGATGTTCGTGAGTTTCCGGT 

ACTAACCATCAGTAATCAGAAGAAAATGGGTGAGGTTGAATTTATTAAAG 

TTAATAAAG ACAAACATTCAgAATCG CTTTTG GGAG CTAAGTTTCAACTT 

CAGATAGAAAAAGATTTTT CTGGGTAT AAG CAATTTGTT C CAGAGGGAAG 

TGATGTTACAACAAAGAATGATGGTAAAATTTATTTTAAAGCACTTCAAG 

ATGGTAACTATAAATTATATGAAATTTCAAGTCCAgATGGCTATATAGAG 

GTTAAAACGAAACCTGTTGTGACATTTACAATTCAAAATGGAGAAGTTAC 

GAACCTGAAAG CAG ATCCAAATGCT AAT AAAAAT CAAATCGG GT AT CTTG 

AAGGAAATGGTAAACATCTTATTACCAACACTCCCAAACGCCCACCAGGT 

GTT 

SEQ ID NO. 4107: SAG0649 PROM M781 GBS TYPE III STRAIN 

TTGGAAAAGTA^TTGTTAAAAAAACGGGAGACACTGCTACACCATTAGGC 
AAAG CGACTTTTGTGTT AAAAAAT G ACAATGATAAGT CAGAAACAAGT CA 
CGAAACGGTAGAC^GTTCTGGAAAAGCAACCTTTGAAAACATAAA^CCTG 
GAGACTACACATTAAGAGAAGAAACAGCACCAATTGGTTATAAAAAAACT 
GATAAAAC CTGGAAAGTTAAAGTTG CAGATAACGG AG CAmCAATAATCG A 
GGGT ATGGATGCAGATAAAG CAGAG AAACG A^AAGAAGTTTTG AATG CCC 
AATAT C CAAAAT CAG CT ATT T ATGAGGATACAAAAGAAAATTACC CATT A 
gTTAATGTAGAGGGTTCCAAAGTTGGTGAACAATACAAAGCATTGAATCC 
AATAAATGG AAAAG ATGGT CgAAG AGAGATTG CTG AAGGTTG GTTAT CAA 
AAAAAATTACaGGGGTCAATGATCTCGATAAGAATAAATATAAAATTGAA 
TTAACTGTTGAGGGTAAAACCACTGTTGAAACgAAAGAACTTAATCAACC 
ACrAGATGTCGTTGTGCTATTAGATAATTCAAATAGTATGAATAATGAAA 
GAGC CAAT AATTCT CAAAGAG CATT AAAAG CTGGG G AAG CAGTTGAAAAG 
CT GATTGAT AAAATTACATC AAATAAAGACAATAG AGTAG CT CTTGTG AC 
ATATGCCTCAACCATTTTTGATGGTACTGAAGCGACCGTATCAAAGGGAG 
TTGCCGATCAAAATGGTAAAGCGCTGAATGATAGTGTATCATGGGATTAT 
CATAAAACT ACTTTT ACAG CAACTACACATAATTACAGTTATTT AAATT T 
AACAAATGATG CTAACG AAGTTAATATTCT AAAGT CAAG AATTC CAAAGG 
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AAGCGGAGCATATAAATGGGGATCGCACX3CTCTATCAATTTGGTGCGACA 
TTTACTCAAAAAGCTCTAATGAAAGCAAATGAAATTTTAGAGACACAAAG 
TTCTAATGCTAGAAAAAAACTTATTTTTCACX5TAACTGATGGTGTCCCTA 
CGATGTCTTATGCCATAAATTTTAATCCTTATA^^ 

AACCAGTTTAATTCTTTTTTAAATAAAATACCAGATAGAAGTGGTATTCT 
CCAAGAGGATTTTATAATCAATGGTGATGATTATCAAATAGTAAAAGGAG 
ATGGAGAGAGTTTTAAACTGTTTTCGGATAGAAAAGTTCCTGTTACTGGA 
GGAACGACACAAGCAGCTTATCGAGTACCGCAAAATCAACTCTCTGTAAT 
GAGTAATGAGGGATATGCAATTAATAGTGGATATATTTATCTCTAtTGGA 
GAGATTACAACTGGGTCTATCCATTTGATCCTAAGACAAAGAAAGTTTCT 
GCAACGAAACAAATCAAAACTCATGGTGAGCCAACAACATTATACTTTAA 
TGGAAATATAAGACCTAAAGGTTATGACATTTTTACTGTTGGGATTGGTG 
TAAACGG AGATCCTG GTGCAACTC CTCTTG AAG CTGAGAAATTTATG CAA 
TCAATATCAAGTAAAACAGAAAATTATACTAATGTTGATGATACAAATAA 
AATTTATGATGAGCTAAATAAATACTTTAAAACAATTGTTGAGGAAAAAC 
ATTCTATTGTTGATGGAAATGTGACTGATCCTATGGGAGAGATGATTGAA 
TTCCAATTAAAAAATGGTCAAAGTTTTACACATGATGATTACGTTTTGGT 
TGGAAATGATGGCAGTCAATTAAAAAATGGTGTGGCTCTTGGTGGACCAA 
ACAGTGATGGGGGAATTTTAAAAGATGTTACAGTGACTTATGATAAGACA 
TCTCAAACCATCAAAATCAATCATTTGAACTTAGGAAGTGGACAAAAAGT 
AGTTCTTACCTATGATGTACGTTTAAAAGATAACTATATAAGTAACAAAT 
TTT ACAATACAAATAAT CGTACAACG CTAAGT CCGAAGAGTGAAAAAGAA 
CCAAATACTATTCGTGATTTCCCAATTCCCAAAATTCGTGATGTTCGTGA 
GTTTCCGGTACTAAC CAT CAGTAAT CAGAAGAAAATGGGT GAGGTTGAAT 
TTATTAAAGTTAATAAAGACAAACATT CAGAATCG CTTTTGGGAG CT AAG 
TTTCAACTTCAGATAGAAAAAGATTTTTCTGGGTATAAGCAATTTGTTCC 
AGAGG GAAGTGATGTTACAACAAAG AATG ATGGTAAAATT TATTTTAAAG 
CACTTCAAGATGGTAACTATAAATTATATGAAATTT CAAGTCCAG ATGG C 
TATATAGAGGTTAAAACGAAACCTGTTGTGACATTTACAATTCAAAATGG 
AG AAGTT ACGAACCTGAAAG CAGAT C CAAATG CTAAT AAAAAT CAAATCG 
GGTATCTTGAAGGAAATGGTAAACATCTTATTACCAACACTCCCAAACGC 
CCACCAGGTGTT 

SEQ ID NO. 4108: SAGO 64 9 FROM CJB GBS NONTYPEABLE STRAIN 

GGTGAAACCCAAGATACCAATCAAGCACTTGGAAAAGT 
AATTGTTAAAAAAACGGGAGACAaTGCTACACCATTAGG CAAAG CGACTT 
TTGTGTTAAAAAATGACAATGATAAGTCAGAAACAAGTCACGAAACGGTA * 
GAGGGTT CTGGAr AAGCAAC CTTTG AAAACATAAAAC CTGGAGACTACAC 
- ATTAAGAGAAGAAACAG CAC CAATTGGTT ATAAAAAAACTGATAAAACCT 
GGAAAGTTAAAGTTG CAGAT AACGG AG CAACAAT AATCG AGGGTATGGAT 
GCAG ATAAAG GAGAGAAACGAAAAGAAGTTTTGAATG CCCAATATCCAAA 
AT CAG (CTATTTATG AGG AT ACAAAAGAAAATTAC CC^TT AgTTAATGTAG 
AGGGTT C CAAAGTTGGTGAACAAT ACAAAG CATTGAATCCAATAAATGGA 
AAAGATGGTCGAAGAGAGATTGCTGAAGGTTGGTTATCAAAAAAAATTAC 
aGGGGTCAATGATCTCGATAAGAATAAATATAAAATTGAATTAACTGTTG 
AGGGTAAAACCACTGTTGAAACGAAAGAACTTAATCAACCACTAGATGTC 
GTTGTGCTATTAgATAATTCAAATAGTATGAATAATG AAAGAG C CAATAA 
TrCTCAAAGAGCATTAAAAGOTGGGGAAGCAGTTGAAAAGCTGATTGATA 
AAATTACAT CAAAT AAAGACAATAGAGTAGCT CTTGTGACATATG C CT CA 
ACCATTTTTGATGGTACTGAAGCGACCGTATCAAAGGGAGTTGCCGATCA 
AAATGGTAAAGCGCTGAATGATAGTGTATCATGGGATTATCATAAAACTA 
CTTTTACAG CAACT ACACATAATT ACAGTTATTTAAATTTAAGAAATGAT 
GCTAACGAAGTTAATATTCTAAAGTCAAGAATTCCAAAGGAAGCGGAGCA 
TATAAATGGGGATCG CACG CTCTAT CAATTTGGTG CGACATTTACT CAAA 
AAGCTCTAATGAAAGCAAATGAAATTTTAGAGACACAAAGTTCTAATGCT 
AGAAAAAAACTTATTTTTCACGTAACTGATGGTGTCCCTACGATGTCTTA 
TGC CATAAATTTTAATC CTTATATAT CAACATCTTACCAAAACCAGTTTA 
ATTCTTTTTTAAATAAAATACCAGATAGAAGTGGTATTCTCCAAGAGGAT 
TTTAT AATCAATGGTGATG ATTAT CAAATAGTAAA^G GAG ATGGAGAGAG 
TTTTAAACTGTTTTCGGATAGAAAAGTTCCTGTTACTGGAGGAACGACAC 
AAGCAGCTTATCGAGTACCGCAAAATCAACTCTCTGTAATGAGTAATGAG 
GG AT ATG CAATTAAT AGTG G AT ATATTTAT CT CTATTGGAGAGATTACAA 
CTGGGTCTATCCATTTGATCCTAAGACAAAGAAAGTTTCTGCAACGAAAC 
AAATCAAAACTCATGGTGAGCCAACAACATTATACTTTAATGGAAATATA 
AGACCTAAAG GTTATGACATTTTTACTGTTGGGATTGGTGTAAAC GG AGA 
TCCTGGTGCAACTCCTCTTGAAGCTGAGAAATTTATGCAATCAATATCAA 
GTAAAACAGAAAATTATACTAATGTTGATGATACAAATAAAATTTATGAT 
GAG CTAAATAAATACTTT AAAACAATTGTTGAGG AAAAACATTCTAT TGT 
TG ATGGAAATGTGACTG AT CCT ATGGG AGAGATGATT GAATTCCAAT TAA 
AAAATGGTCAAAGTTTTACACATGATGATTACGTTTTGGTTGGAAATGAt 
GGCAGTCAATTAAAAAATGGTGTGGCTCTTGGTGGACCAAACAGTGATGG 
GGGAATTTT AAAAG ATGTTACAGT GACTTATG AT AAG ACATCT CAAAC CA 
TCAAAAT CAATCATTTG AACTTAG GAAGTGG ACAAAAAGT AGTT CTTACC 
TATGATGTACGTTTAAAAGATAACTATATAAGTAACAAATTTTACAATAC 
AAATAATCGTACAACGCTAAGTCCGAAGAGTGAAAAAGAACCAAATACTA 
TTCGTGATTTCCCAAT t CCCAAAATTCGTGATGTTCGTGAGTTTC CGGTA 
CTAACCAT CAGTAAT CAGAAGAAAATGGGTGAGGTTG AATTTATT AAAGT 
TAATAAAGACAAACATT CAGAATCG CTTTTGGGAG CTAAGTTTCAACTTC 
AGATAGAAAAAGATTTTTCTGGGTATAAGCAATTTGTTCCAGAGGGAAGT 
GATGTTACAACAAAGAATGATGGTAAAATTTATTTTAAAGCACTTCAAGA 
TGGTAACTATAAATTATATGAAATTTCAAGTCCAGATGGCTATATAGAGG 
TTAAAACGAAACCTGTTGTGACATTTACAATTCAaAATGGAGAAGTTACG 
AACCTGAAAG CAGAT C CAAATG CTAATAAAAAT CAAATCG GGTAT CT TGA 
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AGGAAATGGTAAACATCTITATTACCAACACTCCCAAACGCCCACCAGGTG 
TT 

SEQ ID NO. 4109: SAG0649 FROM JM9130013 GBS TYPE VIXI STRAIN 

GGTGAAACCCAAGATACCAATCAAGCACTTGGAAAAG 

TAATTGTTAAAAAAACGGGAGACAATGCTACACCATTAGGCAAAGCGACT 

TTTGTGTTAAAAAATGACAATGATAAGTCAGAAAGAAGTCACGAAACGGT 

AGAGGGTTCTGGAGAAGCAACCTTTGAAAACATAAAACCTGGAGACTACA 

CATT AAG AG AAGAAACAGCAC CAATTG GTTAT AAAAAAACTGAT AAAAC C 

TGGAAAGTTAAAGTTGCAGATAACGGAGCAACAATAATCGAGGGTATGGA 

TGCAGATAAAGCAGAGAAACGAAAAGAAGTTTTGAATGCCCAATATCCAA 

AATCAGCTATTTATGAGGATACAAAAGAAAATTACCCATTAGTTAATGTA 

GAGGGTTCCAAAGTTGGTGAACAATACAAAGCATTGAATCCAATAAATGG 

AAAAGATGGTCGAAGAGAGATTGCTGAAGGTTGGTTATCAAAAAAAATTA 

CAGGGGT CAATG AT CTCGATAAGAATAAATATAAAATTG AATTAACTGTT 

GAGGGTAAAACCACTGTTGAAACGAAAGAACTTAATCAACCACTAGATGT 

CGTTGTGCTATTAGATAATTCAAATAGTATGAATAATGAAAGAGCCAATA 

ATTCTCAAAGAGCATTAAAAGCTGGGGAAGCAGTTGAAAAGCTGATTGAT 

AAAATTACAT CAAATAAAG ACAATAGAGTAG CT CTTGTG ACATATG C CT C 

AACCATTTTTGATGGTACTGAAGCGACCGTATCAAAGGGAGTTGCCGATC 

AAAATGGTAAAGCGCTGAATGATAGTGTATCATGGGATTATCATAAAACT 

ACTTTTACAGCAACTACACATAATTACAGTTATTTAAATTTAACAAATGA 

TGCTAACGAAGTTAATATTCTAAAGTCAAGAATTCCAAAGGAAGCGGAGC 

AT AT AAATGG GG AT CG CACG CTCT ATCAATTTGGTG CGACATTTACTCAA 

AAAGCTCTAATGAAAGCAAATGAAATTTTAGAGACACAAAGTTCTAATGC 

TAGAAAAAAACTTATTTTTCACGTAACTGATGGTGTCCCTACGATGTCTT 

ATGC CATAAATTTTAATCCTTATAT AT CAACATCTTACCAAAACCAGTTT 

AATT CTTTTTTAAAT AAAATAC CAG ATAG AAGTGGTATT CTC CAAGAGGA 

TTTTATAATCAATGGTGATGATTATCAAATAGTAAAA.GGAGATGGAGAGA 

GTTTTAAACTGTTTT CGGATAGAAAAGTT C CTGTT ACTGGAGGAACGACA 

CAAG CAG CTT AT CG AGT AC CG CAAAATCAACT CTCTGTAATGAGTAATGA 

GGGATATGCAATTAATAGTGGATATATTTATCTCTATTGGAGAGATTACA 

ACTGGGTCTATC^TTTGATCCTAAGACAAAGAAAGTTTCTGCAACGAAA 

CAAAT CAAAACT CATGGTG AGCCAACAAGATTATACTTTAATGG AAATAT 

AA.GACCTAAAGGTTATGACATTTTTACTGTTGGGATTGGTGTAAACGGAG 

ATCCTGGTG CAACT C CTCTTGAAG CTG AGAAATTTATGCAATCAATATCA 

AGTAAAACAGAAAATTATACTAATGTTGATGATACAAATAAAATTTATGA 

TGAGCTAAATAA^T ACT TTAAAACAATTGTTGAGGAAAAA CATT CTATTG 

TTGATGGAAATGTGACTGATCOTATGGGAGAGATGATTGAATTCCAATTA 

AAAAATGGTCAAAGTTTTACACATGATGATTACGTTTTGGTTGGAAATGA 

TGGCAGTCAATTAAAAAATGGTGTGGCTCnrrGGTGGACCAAACAGTGATG 

GGGGAATTTTAAAAGATGTTAC^GTGACTTATGATAAGACATCTCAAACC 

ATCAAAAT CAAT (^TTTGAACIT AGGAAGTGGACAAAAAGTAGTT CTTAC 

CTATGATGTACGTTTAAAAGATAACTATATAAGTAACAAATTTTACAATA 

CAAAT AATCGTACAACG CT AAGT C CGAAG AGTGAAAAAG AACCAAATACT 

ATTCGTGATTTCCCAATTCCCAAAATTCGTGATGTTCGTGAGTTTCCGGT 

ACTAACCAT CAGT AATCAAAAG AAAATGGGTG AGGTTGAATTTATTAAAG 

TTAAT AAAGACAAACATTCAGAAT CG CTTTTGGGAG CTAAGTTTCAACTT 

CAGATAAAAAAAGATTTTTCTGGGTATAAGCAATTTGTTCCAGAGGGAAG 

TGATGTTACAACAAAGAATGATGGTAAAATTTATTTTAAAGC^ 

ATGGTAACTATAAATTATATGAAATTTCAAGTCCAGATGGCTATATAGAG 

GTTAAAACG AAAC CTGTTGTGACATTTACAATTCAAAATGGAGAAGTTAC 

GAAC CTGAAAGCAG ATCCAAATG CTAATAAAAAT CAAAT CGG GT AT CTTG 

AA 



736 
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Table 41: Comparative Sequences relating to SAG0649 

PRETTY of: /biotmp/msal78297 . 2 { * } May 12, 2003 09:22 

1 50 

msal78297 .2{l04__090} — ~ — 

msal78297 . 2 { 104_18RS2l} 

msal78297 . 2 { 104_2603 } atgaaaaaga gacaaaaaat atggagaggg ttatcagtta ctttactaat 

msal78297 . 2 { 104_CJB110 } ~ 

msa!78297 . 2 {l04_COHl} 

msal78297 . 2{l04_M732} 

msal78297 .2{104_A909} 

msal78297.2{l04_M78l} 

msal78297 . 2{l04_JM913 0 013} 

Consensus ********** ********** ********** ********** ********** 

51 100 

msal78297 . 2 { 104_090} ggtgaa acccaagata 

msal78297 .2 {l04_18RS2l} ggtgaa acccaagata 

msal78297 . 2{l04_2603} cctgtcccaa attccatttg gtatattggt acaaggtgaa acccaagata 

msa!78297 . 2 { 104_CJB110 } ggtgaa acccaagata 

msa!78297 . 2 { 104_COH1 } « ggtgaa acccaagata 

msal78297 . 2 { 104__M732 } ggtgaa acccaagata 

msal78297.2{l04_A909} ggtgaa acccaagata 

msal78297 .2{l04_M78l} 

msal782 97 . 2 { 104_JM9130013 } — ggtgaa acccaagata 

Consensus ********** ********** ********** **** __, 



msal78297 
msal78297.2{ 
rasal78297 . 
msal78297.2{ 
msal78297. 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297.2{!04 



.2{104_090 
104_18RS21 
2{104_2603 
104_CJB110 
2{104_COH1 
2{104_M732 
2{104_A909 
2{l04_M781 
_JM9130013 
Consensus 



101 

ccaatcaagc 

ccaatcaagc 

ccaatcaagc 

ccaatcaagc 

ccaatcaagc 

ccaatcaagc 

ccaatcaagc 

ccaatcaagc 



acTTGGAAAA 
acTTGGAAAA 
acTTGGAAAA 
acTTGGAAAA 
acTTGGAAAA 
acTTGGAAAA 
acTTGGAAAA 
— TTGGAAAA 
acTTGGAAAA 



GTAATTGTTA 
GTAATTGTTA 
GTAATTGTTA 
GTAATTGTTA 
GTAATTGTTA 
GTAATTGTTA 
GTAATTGTTA 
GTAATTGTTA 
GTAATTGTTA 



AAAAAACGGG 
AAAAAACGGG 
AAAAAACGGG 
AAAAAACGGG 
AAAAAACGGG 
AAAAAACGGG 
AAAAAACGGG 
AAAAAACGGG 
AAAAAACGGG 



150 

aGACAaTGCT 
aGACAaTGCT 
aGACAaTGCT 
aGACAaTGCT 
aGACAaTGCT 
aGACAaTGCT 
gGACAaTGCT 
aGACAcTGCT 
aGACAaTGCT 



******** ********** ********** _****„**** 



msa!78297 
msal78297 .2 { 
msal78297 . 
msal78297 .2{ 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297.2{l04 



.2{104_090} 
104_18RS2l) 
2{l04_2603 ! 
104_CJB110~ 
2{104_COH1 
2{104_M732 
2{104_A909 
2{l04_M781 
__»JM9130013/ 
Consensus 



151 

ACACCATTAG 

ACACCATTAG 

ACACCATTAG 

ACACCATTAG 

ACACCATTAG 

ACACCATTAG 

ACACCATTAG 

ACACCATTAG 

ACACCATTAG 
********** 



GCAAAGCGAC 
GCAAAGCGAC 
GCAAAGCGAC 
GCAAAGCGAC 
GCAAAGCGAC 
GCAAAGCGAC 
GCAAAGCGAC 
GCAAAGCGAC 
GCAAAGCGAC 
********** 



TTTTGTGTTA 
TTTTGTGTTA 
TTTTGTGTTA 
TTTTGTGTTA 
TTTTGTGTTA 
TTTTGTGTTA 
TTTTGTGTTA 
TTTTGTGTTA 
TTTTGTGTTA 
********** 



AAAAATGACA 

AAAAATGACA 

AAAAATGACA 

AAAAATGACA 

AAAAATGACA 

AAAAATGACA 

AAAAATGACA 

AAAAATGACA 

AAAAATGACA 
********** 



200 

ATGATAAGTC 

ATGATAAGTC 

ATGATAAGTC 

ATGATAAGTC 

ATGATAAGTC 

ATGATAAGTC 

ATGATAAGTC 

ATGATAAGTC 

ATGATAAGTC 
********** 



msal78297 
msal78297.2{ 
msal78297 . 
msal78297.2{ 
msal78297 . 
msa!78297. 
msal78297 . 
msal78297 . 
msal78297 .2(104 



-2{104_090 
104_18RS21 
2{104_2603 
104_CJB110 
2{104_COH1; 
2{104_M732 
2{104_A909 
2{104_M781 
_JM9130013 
Consensus 



201 

AGAAACAAGT 
AGAAACAAGT 
AGAAACAAGT 
AGAAACAAGT 
AGAAACAAGT 
AGAAACAAGT 
AGAAACAAGT 
AGAAACAAGT 
AGAAACAAGT 
********** 



CACGAAACGG 

CACGAAACGG 

CACGAAACGG 

CACGAAACGG 

CACGAAACGG 

CACGAAACGG 

CACGAAACGG 

CACGAAACGG 

CACGAAACGG 
********** 



TAGAGGGTTC 
TAGAGGGTTC 
TAGAGGGTTC 
TAGAGGGTTC 
TAGAGGGTTC 
TAGAGGGTTC 
TAGAGGGTTC 
TAGAGGGTTC 
TAGAGGGTTC 



TGGAgAAGCA 
TGGAgAAGCA 
TGGAgAAGCA 
TGGArAAGCA 
TGGArAAGCA 
TGGAgAAGCA 
TGGAgAAGCA 
TGGAaAAGCA 
TGGAgAAGCA 



250 

ACCTTTGAAA 
ACCTTTGAAA 
ACCTTTGAAA 
ACCTTTGAAA 
ACCTTTGAAA 
ACCTTTGAAA 
ACCTTTGAAA 
ACCTTTGAAA 
ACCTTTGAAA 



********** ****_***** ********** 



msal78297 
msal78297 .2 { 
msal78297. 
msal78297 .2{ 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297 . 
msa!782 97.2{l04 



.2{104_090} 
104_18RS2l} 
2{104_2603) 
104_CJB110) 
2{104_COH1} 
2{104_M732} 
2{104_A909) 
2{104_M781) 
_JM9130013} 
Consensus 



251 

ACATAAAACC 

ACATAAAACC 

ACATAAAACC 

ACATAAAACC 

ACATAAAACC 

ACATAAAACC 

ACATAAAACC 

ACATAAAACC 

ACATAAAACC 
********** 



TGGAGACTAC 

TGGAGACTAC 

TGGAGACTAC 

TGGAGACTAC 

TGGAGACTAC 

TGGAGACTAC 

TGGAGACTAC 

TGGAGACTAC 

TGGAGACTAC 
********** 



ACATTAAGAG 

ACATTAAGAG 

ACATTAAGAG 

ACATTAAGAG 

ACATTAAGAG 

ACATTAAGAG 

ACATTAAGAG 

ACATTAAGAG 

ACATTAAGAG 
********** 



AAGAAACAGC 
AAGAAACAGC 
AAGAAACAGC 
AAGAAACAGC 
AAGAAACAGC 
AAGAAACAGC 
AAGAAACAGC 
AAGAAACAGC 

AAGAAACAGC 
********** 



300 

ACCAATTGGT 

ACCAATTGGT 

ACCAATTGGT 

ACCAATTGGT 

ACCAATTGGT 

ACCAATTGGT 

ACCAATTGGT 

ACCAATTGGT 

ACCAATTGGT 
********** 



msal78297 
msal78297 ,2{ 

msa!78297 . 
msal78297 .2{ 

msal78297 . 

msal78297 . 

msal78297 . 

msal78297 . 



•2{104_090) 

104_18RS21* 

2{104_2603 

104_CJB110 

2{104_COH1, 

2{104_M732} 

2f 104_A909) 

2{l04_M781) 



301 

TATAAAAAAA 
TATAAAAAAA 
TATAAAAAAA 
TATAAAAAAA 
TATAAAAAAA 
TATAAAAAAA 
TATAAAAAAA 
TATAAAAAAA 



CTGATAAAAC 
CTGATAAAAC 
CTGATAAAAC 
CTGATAAAAC 
CTGATAAAAC 
CTGATAAAAC 
CTGATAAAAC 
CTGATAAAAC 



CTGGAAAGTT 
CTGGAAAGTT 
CTGGAAAGTT 
CTGGAAAGTT 
CTGGAAAGTT 
CTGGAAAGTT 
CTGGAAAGTT 
CTGGAAAGTT 



AAAGTTGCAG 
AAAGTTGCAG 
AAAGTTGCAG 
AAAGTTGCAG 
AAAGTTGCAG 
AAAGTTGCAG 
AAAGTTGCAG 
AAAGTTGCAG 



350 

ATAACGGAGC 
ATAACGGAGC' 
ATAACGGAGC 
ATAACGGAGC 
ATAACGGAGC 
ATAACGGAGC 
ATAACGGAGC 
ATAACGGAGC 
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Table 41: Comparative Sequences relating to SAG0649 



msal78297.2{l04_JM9130013} 

Consensus 



TATAAAAAAA CTGATAAAAC CTGGAAAGTT AAAGTTGCAG ATAACGGAGC 
********** ********** ********** ********** ********** 



msal78297 
msal78297.2{ 
msal78297 . 
msal78297 ,2{ 
msa!78297 . 
rasal78297 . 
msal78297 . 
msal78297 . 
msal78297.2{l04 



.2{104_090 
104_18RS21 
2{104_2603 
104_CJB110 
2(l04_COHl 
2{104_M732 
2{l04_A909] 
2{l04_M781 
_JM9130013 
Consensus 



351 

AaCAATAATC 
AaCAATAATC 
AaCAATAATC 
AaCAATAATC 
AaCAATAATC 
AaCAATAATC 
AaCAATAATC 
AmCAATAATC 
AaCAATAATC 



GAGGGTATGG 
GAGGGTATGG 
GAGGGTATGG 
GAGGGTATGG 
GAGGGTATGG 
GAGGGTATGG 
GAGGGTATGG 
GAGGGTATGG 
GAGGGTATGG 



ATGCAGATAA 
ATGCAGATAA 
ATGCAGATAA 
ATGCAGATAA 
ATGCAGATAA 
ATGCAGATAA 
ATGCAGATAA 
ATGCAGATAA 
ATGCAGATAA 



*_******** ********** ********** 



AGCAGAGAAA 

AGCAGAGAAA 

AGCAGAGAAA 

AGCAGAGAAA 

AGCAGAGAAA 

AGCAGAGAAA 

AGCAGAGAAA 

AGCAGAGAAA 

AGCAGAGAAA 
********** 



400 

CGAAAAGAAG 

CGAAAAGAAG 

CGAAAAGAAG 

CGAAAAGAAG 

CGAAAAGAAG 

CGAAAAGAAG 

CGAAAAGAAG 

CGAAAAGAAG 

CGAAAAGAAG 
********** 



msal78297 
msal78297.2{ 
msal78297 - 
msal78297 ,2{ 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297.2{l04 



.2{104_090 
104_18RS21 
2{104_2603 
104_CJB110 
2{l04_COHl 
2{l04_M732 
2fl04_A909 
2{l04_M781 
_JM9130013] 
Consensus 



401 

TTTTGAATGC 

TTTTGAATGC 

TTTTGAATGC 

TTTTGAATGC 

TTTTGAATGC 

TTTTGAATGC 

TTTTGAATGC 

TTTTGAATGC 

TTTTGAATGC 
********** 



CCAATATCCA 
CCAATATCCA 
CCAATATCCA 
CCAATATCCA 
CCAATATCCA 
CCAATATCCA 
CCAATATCCA 
CCAATATCCA 
CCAATATCCA 
********** 



AAATCAGCTA 

AAATCAGCTA 

AAATCAGCTA 

AAATCAGCTA 

AAATCAGCTA 

AAATCAGCTA 

AAATCAGCTA 

AAATCAGCTA 

AAATCAGCTA 
********** 



TTTATGAGGA 

TTTATGAGGA 

TTTATGAGGA 

TTTATGAGGA 

TTTATGAGGA 

TTTATGAGGA 

TTTATGAGGA 

TTTATGAGGA 

TTTATGAGGA 
********** 



450 

TACAAAAGAA 

TACAAAAGAA 

TACAAAAGAA 

TACAAAAGAA 

TACAAAAGAA 

TACAAAAGAA 

TACAAAAGAA 

TACAAAAGAA 

TACAAAAGAA 
********** 



msal78297 
msal78297.2{ 
msal78297 . 
msal78297 .2{ 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297.2{!04 



.2{104_090} 
104_18RS21 
2{104_2603 
104_CJB110 
2{104 > _COH1 
2{104_M732] 
2{104_A909} 
2{l04_M78l} 
JM9130013} 
Consensus 



451 

AATTACCCAT 
AATTACCCAT 
AATTACCCAT 
AATTACCCAT 
AATTACCCAT 
AATTACCCAT 
AATTACCCAT 
AATTACCCAT 
AATTACCCAT 
********** 



TAGTTAATGT 

TAGTTAATGT 

TAGTTAATGT 

TAGTTAATGT 

TAGTTAATGT 

TAGTTAATGT 

TAGTTAATGT 

TAGTTAATGT 

TAGTTAATGT 
********** 



AGAGGGTTCC 

AGAGGGTTCC 

AGAGGGTTCC 

AGAGGGTTCC 

AGAGGGTTCC 

AGAGGGTTCC 

AGAGGGTTCC 

AGAGGGTTCC 

AGAGGGTTCC 
********** 



AAAGTTGGTG 

AAAGTTGGTG 

AAAGTTGGTG 

AAAGTTGGTG 

AAAGTTGGTG 

AAAGTTGGTG 

AAAGTTGGTG 

AAAGTTGGTG 

AAAGTTGGTG 
********** 



500 

AACAATACAA 

AACAATACAA 

AACAATACAA 

AACAATACAA 

AACAATACAA 

AACAATACAA 

AACAATACAA 

AACAATACAA 

AACAATACAA 
********** 



msal78297 
msal78297 .2{ 
msal78297 . 
msal78297 .2{ 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297.2{l04 



.2{ 104^090} 
104_18RS2l} 
2{104_2603} 
104_CJB110} 
2{104_COH1} 
2ll04_M732} 
2\ 104__A909} 
2{104_M781} 
_JM9130013} 
Consensus 



501 

AGCATTGAAT 

AGCATTGAAT 

AGCATTGAAT 

AGCATTGAAT 

AGCATTGAAT 

AGCATTGAAT 

AGCATTGAAT 

AGCATTGAAT 

AGCATTGAAT 
********** 



CCAATAAATG 

CCAATAAATG 

CCAATAAATG 

CCAATAAATG 

CCAATAAATG 

CCAATAAATG 

CCAATAAATG 

CCAATAAATG 

CCAATAAATG 
********** 



GAAAAGATGG 

GAAAAGATGG 

GAAAAGATGG 

GAAAAGATGG 

GAAAAGATGG 

GAAAAGATGG 

GAAAAGATGG 

GAAAAGATGG 

GAAAAGATGG 
********** 



TCGAAGAGAG 

TCGAAGAGAG 

TCGAAGAGAG 

TCGAAGAGAG 

TCGAAGAGAG 

TCGAAGAGAG 

TCGAAGAGAG 

TCGAAGAGAG 

TCGAAGAGAG 
********** 



550 

ATTGCTGAAG 

ATTGCTGAAG 

ATTGCTGAAG 

ATTGCTGAAG 

ATTGCTGAAG 

ATTGCTGAAG 

ATTGCTGAAG 

ATTGCTGAAG 

ATTGCTGAAG 
********** 



msal7B297 
msal78297 .2{ 
msal78297 . 
msal78297 . 2 { 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297 .2{l04 



,2{l04_090 
104_18RS21 
2{104_2603 
104_CJB110 
2{104_COH1 
2{104_M732 
2{104_A909 
2{104_M781 
_JM9130013; 
Consensus 



551 

GTTGGTTATC 

GTTGGTTATC 

GTTGGTTATC 

GTTGGTTATC 

GTTGGTTATC 

GTTGGTTATC 

GTTGGTTATC 

GTTGGTTATC 

GTTGGTTATC 
********** 



AAAAAAAAtT 
AAAAAAAAtT 
AAAAAAAAtT 
AAAAAAAAtT 
AAAAAAAAaT 
AAAAAAAAaT 
AAAAAAAAtT 
AAAAAAAAtT 
AAAAAAAAtT 
********„* 



ACAGGGGTCA 

ACAGGGGTCA 

ACAGGGGTCA 

ACAGGGGTCA 

ACAGGGGTCA 

ACAGGGGTCA 

ACAGGGGTCA 

ACAGGGGTCA 

ACAGGGGTCA 
********** 



ATGATCTCGA 

ATGATCTCGA 

ATGATCTCGA 

ATGATCTCGA 

ATGATCTCGA 

ATGATCTCGA 

ATGATCTCGA 

ATGATCTCGA 

ATGATCTCGA 
********** 



600 

TAAGAATAAA 
TAAGAATAAA 
TAAGAATAAA 
TAAGAATAAA 
TAAGAATAAA 
TAAGAATAAA 
TAAGAATAAA 
TAAGAATAAA 

TAAGAATAAA 
********** 



msal78297 
msal78297.2{ 
msal78297 . 
msal78297 .2{ 
msal78297 . 
msal78297 . 
rasal78297 . 
msal78297 . 
msal78297.2{l04 



.2{l04_090} 
104_18RS2i; 
2{104_2603 
104_CJB110; 
2{104_COH1} 
2{104_M732^ 
2{104_A909 
2{104_M781 
JM9130013; 
Consensus 



601 

TATAAAATTG 

TATAAAATTG 

TATAAAATTG 

TATAAAATTG 

TATAAAATTG 

TATAAAATTG 

TATAAAATTG 

TATAAAATTG 

TATAAAATTG 
********** 



AATTAACTGT 

AATTAACTGT 

AATTAACTGT 

AATTAACTGT 

AATTAACTGT 

AATTAACTGT 

AATTAACTGT 

AATTAACTGT 

AATTAACTGT 
********** 



TGAGGGTAAA 

TGAGGGTAAA 

TGAGGGTAAA 

TGAGGGTAAA 

TGAGGGTAAA 

TGAGGGTAAA 

TGAGGGTAAA 

TGAGGGTAAA 

TGAGGGTAAA 
********** 



ACCACTGTTG 

ACCACTGTTG 

ACCACTGTTG 

ACCACTGTTG 

ACCACTGTTG 

ACCACTGTTG 

ACCACTGTTG 

ACCACTGTTG 

ACCACTGTTG 
********** 



650 

AAACGAAAGA 

AAACGAAAGA 

AAACGAAAGA 

AAACGAAAGA 

AAACGAAAGA 

AAACGAAAGA 

AAACGAAAGA 

AAACGAAAGA 

AAACGAAAGA 
********** 



msa!78297.2{l04_090} 
msal78297 .2 {l04_18RS2l} 
msa!78297 .2{104_2603) 
msal78297 .2 {104_CJB110} 
msal78297 .2{104_COH1} 
msal78297 .2f 104_M732) 
msal78297 .2{104_A909) 



651 

ACTTAATCAA 
ACTTAATCAA 
ACTTAATCAA 
ACTTAATCAA 
ACTTAATCAA 
ACTTAATCAA 
ACTTAATCAA 



CCACTAGATG 
CCACTAGATG 
CCACTAGATG 
CCACTAGATG 
CCACTAGATG 
CCACTAGATG 
CCACTAGATG 



TCGTTGTGCT 
TCGTTGTGCT 
TCGTTGTGCT 
TCGTTGTGCT 
TCGTTGTGCT 
TCGTTGTGCT 
TCGTTGTGCT 



ATTAGATAAT 
ATTAGATAAT 
ATTAGATAAT 
ATTAGATAAT 
ATTAGATAAT 
ATTAGATAAT 
ATTAGATAAT 



700 

TCAAATAGTA 
TCAAATAGTA 
TCAAATAGTA 
TCAAATAGTA 
TCAAATAGTA 
TCAAATAGTA 
TCAAATAGTA 
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msal7&297 , 2 { 104_M781> 
msal78297 . 2 { 10.4_JM9130013 } 

Consensus 



ACTTAATCAA C CACTAGATG TCGTTGTGCT ATTAGATAAT TCAAATAGTA 

ACTTAATCAA C CACTAGATG TCGTTGTGCT ATTAGATAAT TCAAATAGTA 
********** ********** ********** ********** ********** 



msal78297 
msal78297 .2{ 
msal78297 . 
msal78297 .2{ 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297.2{l04 



.2{l04_090} 
104_18RS2l} 
2 { 104^2603 } 
104_CJB110 } 
2{104_COH1} 
2{104_M732} 
2{104_A909} 
2{104J4781} 
_JM9130013} 
Consensus 



701 

TGAATAATGA 
TGAATAATGA 
TGAATAATGA 
TGAATAATGA 
TGAATAATGA 
TGAATAATGA 
TGAATAATGA 
TGAATAATGA 
TGAATAATGA 



AAGAGCCAAT 
AAGAGCCAAT 
AAGAGCCAAT 
AAGAGCCAAT 
AAGAGCCAAT 
AAGAGCCAAT 
AAGAGCCAAT 
AAGAGCCAAT 
AAGAGCCAAT 



AATTCTCAAA 
AATTCTCAAA 
AATTCTCAAA 
AATTCTCAAA 
AATTCTCAAA 
AATTCTCAAA 
AATTCTCAAA 
AATTCTCAAA 
AATTCTCAAA 



GAGCATTAAA 
GAG CATTAAA 
GAGCATTAAA 
GAGCATTAAA 
GAGCATTAAA 
GAGCATTAAA 
GAGCATTAAA 
GAGCATTAAA 
GAGCATTAAA 



750 

AGCTGGGGAA 
AGCTGGGGAA 
AGCTGGGGAA 
AGCTGGGGAA 
AGCTGGGGAA 
AGCTGGGGAA 
AGCTGGGGAA 
AGCTGGGGAA 
AGCTGGGGAA 



********** ********** ********** ********** ********** 



msal78297 
msal78297 .2{ 
msal78297 . 
msal78297.2{ 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297.2{l04 



.2{104_090} 
104_18RS2l} 
2{l04_2603} 
104_CJB110 
2{l04_COHl 
2{104_M732 
2{104_A909 
2{104_M781 ■ 
_JM9130013 
Consensus 



751 

GCAGTTGAAA 
GCAGTTGAAA 
GCAGTTGAAA 
GCAGTTGAAA 
GCAGTTGAAA 
GCAGTTGAAA 
GCAGTTGAAA 
GCAGTTGAAA 
GCAGTTGAAA 
********** 



AG CTGATTGA 
AGCTGATTGA 
AG CTGATTGA 
AGCTGATTGA 
AGCTGATTGA 
AGCTGATTGA 
AGCTGATTGA 
AGCTGATTGA 
AGCTGATTGA 
********** 



TAAAATTACA 

TAAAATTACA 

TAAAATTACA 

TAAAATTACA 

TAAAATTACA 

TAAAATTACA 

TAAAATTACA 

TAAAATTACA 

TAAAATTACA 
********** 



TCAAATAAAG 

TCAAATAAAG 

TCAAATAAAG 

TCAAATAAAG 

TCAAATAAAG 

TCAAATAAAG 

TCAAATAAAG 

TCAAATAAAG 

TCAAATAAAG 
********** 



800 

ACAATAGAGT 

ACAATAGAGT 

ACAATAGAGT 

ACAATAGAGT 

ACAATAGAGT 

ACAATAGAGT 

ACAATAGAGT 

ACAATAGAGT 

ACAATAGAGT 
********** 



msal78297 
msal78297.2{ 
msal78297 . 
msal78297 ,2{ 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297 . 
msa!78297.2{l04 



.2{l04_090 
104_18RS21 
2{l04_2603 
104_CJB110 
2{l04_COHl; 
2{104_M732 ! 
2{104_A909; 
2{l04_M78l' 
_JM9130013/ 
Consensus 



801 

AGCTCTTGTG 

AGCTCTTGTG 

AGCTCTTGTG 

AGCTCTTGTG 

AGCTCTTGTG 

AGCTCTTGTG 

AGCTCTTGTG 

AGCTCTTGTG 

AGCTCTTGTG 
********** 



ACATATGCCT 
ACATATGCCT 
ACATATGCCT 
ACATATGCCT 
ACATATGCCT 
ACATATGCCT 
ACATATGCCT 
ACATATGCCT 
ACATATGCCT 



CAACCATTTT 
CAACCATTTT 
CAACCATTTT 
CAACCATTTT 
CAACCATTTT 
CAACCATTTT 
CAACCATTTT 
CAACCATTTT 
CAACCATTTT 



********** ********** 



TGATGGTACT 

TGATGGTACT 

TGATGGTACT 

TGATGGTACT 

TGATGGTACT 

TGATGGTACT 

TGATGGTACT 

TGATGGTACT 

TGATGGTACT 
********** 



850 

GAAGCGACCG 

GAAGCGAC CG 

GAAGCGACCG 

GAAGCGACCG 

GAAGCGACCG 

GAAGCGACCG 

GAAGCGACCG 

GAAGCGACCG 

GAAGCGACCG 
********** 



msal78297 
msal78297 .2{ 
msal78297 . 
msal78297 .2{ 
msal78297 . 
msa!78297 . 
msal78297 . 
msal78297 . 
msal78297.2{!04 



.2{l04_090} 
104_18RS2l} 
2{104_2603} 
104_CJB110} 
2{104_COH1} 
2{104_M732> 
2{104_A909) 
2{104_M781} 
_JM9130013} 
Consensus 



851 

TATCAAAGGG 
TATCAAAGGG 
TATCAAAGGG 
TATCAAAGGG 
TATCAAAGGG 
TATCAAAGGG 
TATCAAAGGG 
TATCAAAGGG 
TATCAAAGGG 



AGTTGCCGAT 
AGTTGCCGAT 
AGTTGCCGAT 
AGTTGCCGAT 
AGTTGCCGAT 
AGTTGCCGAT 
AGTTGCCGAT 
AGTTGCCGAT 
AGTTGCCGAT 



********** ********** 



CAAAATGGTA 

CAAAATGGTA 

CAAAATGGTA 

CAAAATGGTA 

CAAAATGGTA 

CAAAATGGTA 

CAAAATGGTA 

CAAAATGGTA 

CAAAATGGTA 
********** 



AAGCGCTGAA 

AAGCGCTGAA 

AAGCGCTGAA 

AAGCGCTGAA 

AAGCGCTGAA 

AAGCGCTGAA 

AAGCGCTGAA 

AAGCGCTGAA 

AAGCGCTGAA 
********** 



900 

TGATAGTGTA 

TGATAGTGTA 

TGATAGTGTA 

TGATAGTGTA 

TGATAGTGTA 

TGATAGTGTA 

TGATAGTGTA 

TGATAGTGTA 

TGATAGTGTA 
********** 



msal78297 
msal78297 . 2{ 
rasal78297 - 
msal78297.2{ 
msal78297 . 
msa!78297 . 
msal78297 . 
msal78297 . 
msal78297.2{l04 



.2{104_090} 
104_18RS2lJ 
2{l04_2603} 
104_CJB110" 
2{104_COH1* 
2{l04_M732 
2{104_A909} 
2{104_M781} 
_JM9130013} 
Consensus 



901 

TCATGGGATT 

TCATGGGATT 

TCATGGGATT 

TCATGGGATT 

TCATGGGATT 

TCATGGGATT 

TCATGGGATT 

TCATGGGATT 

TCATGGGATT 
********** 



ATCATAAAAC 

ATCATAAAAC 

ATCATAAAAC 

ATCATAAAAC 

ATCATAAAAC 

ATCATAAAAC 

ATCATAAAAC 

ATCATAAAAC 

ATCATAAAAC 
********** 



TACTTTTACA 

TACTTTTACA 

TACTTTTACA 

TACTTTTACA 

TACTTTTACA 

TACTTTTACA 

TACTTTTACA 

TACTTTTACA 

TACTTTTACA 
********** 



GCAACTACAC 
GCAACTACAC 
GCAACTACAC 
GCAACTACAC 
GCAACTACAC 
GCAACTACAC 
GCAACTACAC 
GCAACTACAC 

GCAACTACAC 
********** 



950 

ATAATTACAG 

ATAATTACAG 

ATAATTACAG 

ATAATTACAG 

ATAATTACAG 

ATAATTACAG 

ATAATTACAG 

ATAATTACAG 

ATAATTACAG 
********** 



msal78297 
msal78297 -2{ 
rasa!78297 . 
msal78297 .2{ 
msal78297 . 
msal78297. 
msal78297 . 
msal78297. 
msal78297.2{!04 



.2{l04_090} 
104_18RS21} 
2{104__2603} 
104_CJB110} 
2{104__COH1} 
2(l04_M732} 
2{104_A909} 
2{104_M781} 
_JM9130013} 
Consensus 



951 

TTATTTAAAT 
TTATTTAAAT 
TTATTTAAAT 
TTATTTAAAT 
TTATTTAAAT 
TTATTTAAAT 
TTATTTAAAT 
TTATTTAAAT 
TTATTTAAAT 



TTAACAAATG 
TTAACAAATG 
TTAACAAATG 
TTAACAAATG 
TTAACAAATG 
TTAACAAATG 
TTAACAAATG 
TTAACAAATG 
TTAACAAATG 



ATGCTAACGA 
ATGCTAACGA 
ATGCTAACGA 
ATGCTAACGA 
ATGCTAACGA 
ATGCTAACGA 
ATGCTAACGA 
ATGCTAACGA 
ATGCTAACGA 



AGTTAATATT 
AGTTAATATT 
AGTTAATATT 
AGTTAATATT 
AGTTAATATT 
AGTTAATATT 
AGTTAATATT 
AGTTAATATT 
AGTTAATATT 



1000 
CTAAAGTCAA 
CTAAAGTCAA 
CTAAAGTCAA 
CTAAAGTCAA 
CTAAAGTCAA 
CTAAAGTCAA 
CTAAAGTCAA 
CTAAAGTCAA 
CTAAAGTCAA 



********** ********** ********** ********** ********** 



msal78297 .2{104_090> 
msal78297 . 2 { 104_18RS21 } 
msal78297 .2{l04_2603* 
msal78297 . 2 { 104__CJB110 
msal78297.2(l04_COHl} 
msal78297 . 2 { 104_M732 } 



1001 

GAATTCCAAA 
GAATTCCAAA 
GAATTCCAAA 
GAATTCCAAA 
GAATTCCAAA 
GAATTCCAAA 



GGAAGCGGAG 
GGAAGCGGAG 
GGAAGCGGAG 
GGAAGCGGAG 
GGAAGCGGAG 
GGAAGCGGAG 



CATATAAATG 
CATATAAATG 
CATATAAATG 
CATATAAATG 
CATATAAATG 
CATATAAATG 



GGGATCGCAC 
GGGATCGCAC 
GGGATCGCAC 
GGGATCGCAC 
GGGATCGCAC 
GGGATCGCAC 



1050 
GCTCTATCAA 
GCTCTATCAA 
GCTCTATCAA 
GCTCTATCAA 
GCTCTATCAA 
GCTCTATCAA 



739 



WO 2004/018646 



Table 41: Comparative Sequences relating to SAG0649 



msal78297 . 2 f 104_A909 } 
msa!78297 . 2 { 104__M781 } 
msal78297 . 2 { 104_JM9130013 } 

Consensus 



GAATTCCAAA GGAAGCGGAG CATATAAATG GGGATCGCAC GCTCTATCAA 

GAATTCCAAA GGAAGCGGAG CATATAAATG GGGATCGCAC GCTCTATCAA 

GAATTCCAAA GGAAGCGGAG CATATAAATG GGGATCGCAC GCTCTATCAA 
********** ********** ********** ********** ********** 



msal78297 
msal78297 .2 { 
msal78297 . 
msal78297 .2{ 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297 . 
msa!78297 .2{l04 



.2{l04_090} 
104_18RS2l} 
2{104_2603} 
104_CJB110} 
2{l04_COHl} 
2{104_M732} 
2{l04_A909j 
2{104_M781} 
_JM9130013} 
Consensus 



1051 

TTTGGTGCGA 

TTTGGTG CGA 

TTTGGTGCGA 

TTTGGTGCGA 

TTTGGTGCGA 

TTTGGTGCGA 

TTTGGTGCGA 

TTTGGTGCGA 

TTTGGTGCGA 
********** 



CATTTACTCA 

CATTTACTCA 

CATTTACTCA 

CATTTACTCA 

CATTTACTCA 

CATTTACTCA 

CATTTACTCA 

CATTTACTCA 

CATTTACTCA 
********** 



AAAAGCTCTA 

AAAAGCTCTA 

AAAAGCTCTA 

AAAAGCTCTA 

AAAAGCTCTA 

AAAAGCTCTA 

AAAAGCTCTA 

AAAAGCTCTA 

AAAAGCTCTA 
********** 



ATGAAAGCAA 

ATGAAAGCAA 

ATGAAAGCAA 

ATGAAAGCAA 

ATGAAAGCAA 

ATGAAAGCAA 

ATGAAAGCAA 

ATGAAAGCAA 

ATGAAAGCAA 
********** 



1100 

ATGAAATTTT 

ATGAAATTTT 

ATGAAATTTT 

ATGAAATTTT 

ATGAAATTTT 

ATGAAATTTT 

ATGAAATTTT 

ATGAAATTTT 

ATGAAATTTT 
********** 



msa!78297 
msal78297 .2{ 
msal78297 . 
msal78297.2{ 
msa!78297 . 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297.2{l04 



.2{l04_090] 
104_18RS21 
2{104_2603 
104_CJB110 
2{104_COH1 
2{104_M732 
2{104_A909 
2{104_M781} 
_JM9130013} 
Consensus 



1101 

AGAGACACAA 
AGAGACACAA 
AGAGACACAA 
AGAGACACAA 
AGAGACACAA 
AGAGACACAA 
AGAGACACAA 
AGAGACACAA 
AGAGACACAA 
********** 



AGTTCTAATG 

AGTTCTAATG 

AGTTCTAATG 

AGTTCTAATG 

AGTTCTAATG 

AGTTCTAATG 

AGTTCTAATG 

AGTTCTAATG 

AGTTCTAATG 
********** 



CTAGAAAAAA 
CTAGAAAAAA 
CTAGAAAAAA 
CTAGAAAAAA 
CTAGAAAAAA 
CTAGAAAAAA 
CTAGAAAAAA 
CTAGAAAAAA 
CTAGAAAAAA 
********** 



ACTTATTTTT 

ACTTATTTTT 

ACTTATTTTT 

ACTTATTTTT 

ACTTATTTTT 

ACTTATTTTT 

ACTTATTTTT 

ACTTATTTTT 

ACTTATTTTT 
********** 



1150 
CACGTAACTG 
CACGTAACTG 
CACGTAACTG 
CACGTAACTG 
CACGTAACTG 
CACGTAACTG 
CACGTAACTG 
CACGTAACTG 
CACGTAACTG 
********** 



msal78297 
msal78297 .2{ 
msal78297 . 
msa!78297 .2 { 
msal78297 . 
msal78297 . 
msal78297 . 
msa!78297 . 
msal78297.2{l04 



.2{104_090} 
104_18RS2l) 
2{104_2603} 
104_CJB110} 
2{l04_COHl) 
2{104_M732} 
2{104_A909} 
2{104_M781} 
_JM9130013} 
Consensus 



1151 

ATGGTGTCCC 
ATGGTGTCCC 
ATGGTGTCCC 
ATGGTGTCCC 
ATGGTGTCCC 
ATGGTGTCCC 
ATGGTGTCCC 
ATGGTGTCCC 
ATGGTGTCCC 



TACGATGTCT 
TACGATGTCT 
TACGATGTCT 
TACGATGTCT 
TACGATGTCT 
TACGATGTCT 
TACGATGTCT 
TACGATGTCT 
TACGATGTCT 



TATGCCATAA 
TATGCCATAA 
TATGCCATAA 
TATGCCATAA 
TATGCCATAA 
TATGCCATAA 
TATGCCATAA 
TATGCCATAA 
TATGCCATAA 



********** ********** ********** 



ATTTTAATCC 

ATTTTAATCC 

ATTTTAATCC 

ATTTTAATCC 

ATTTTAATCC 

ATTTTAATCC 

ATTTTAATCC 

ATTTTAATCC 

ATTTTAATCC 
********** 



1200 

TTATATATCA 

TTATATATCA 

TTATATATCA 

TTATATATCA 

TTATATATCA 

TTATATATCA 

TTATATATCA 

TTATATATCA 

TTATATATCA 
********** 



msal78297 
msal78297 .2 { 
msal78297 . 
. msa!78297 .2{ 
tnsal78297 . 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297.2{l04 



.2{104_090 
104_18RS21 
2{104_2603/ 
104_CJB110] 
2{104_COH1 
2{104_M732 
2{l04_A909 
2{104_M781 
_JM9130013; 
Consensus 



1201 

ACATCTTACC 

ACATCTTACC 

ACATCTTACC 

ACATCTTACC 

ACATCTTACC 

ACATCTTACC 

ACATCTTACC 

ACATCTTACC 

ACATCTTACC 
********** 



AAAACCAGTT 

AAAACCAGTT 

AAAACCAGTT 

AAAACCAGTT 

AAAACCAGTT 

AAAACCAGTT 

AAAACCAGTT 

AAAACCAGTT 

AAAACCAGTT 
********** 



TAATTCTTTT 

TAATTCTTTT 

TAATTCTTTT 

TAATTCTTTT 

TAATTCTTTT 

TAATTCTTTT 

TAATTCTTTT 

TAATTCTTTT 

TAATTCTTTT 
********** 



TTAAATAAAA 

TTAAATAAAA 

TTAAATAAAA 

TTAAATAAAA 

TTAAATAAAA 

TTAAATAAAA 

TTAAATAAAA 

TTAAATAAAA 

TTAAATAAAA 
********** 



1250 

TACCAGATAG 

TACCAGATAG 

TACCAGATAG 

TACCAGATAG 

TACCAGATAG 

TACCAGATAG 

TACCAGATAG 

TACCAGATAG 

TACCAGATAG 
********** 



msa!78297 
msal78297.2{ 
msal782 97 . 
msal78297 .2{ 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297 . 
msa!78297.2{l04 



.2{l04_090j 
104_18RS2l} 
2{104_2603} 
104_CJB110" 
2{104_COH1 
2{104_M732 
2{104_A909 
2{104_M781) 
_JM9130013} 
Consensus 



1251 

AAGTGGTATT 

AAGTGGTATT 

AAGTGGTATT 

AAGTGGTATT 

AAGTGGTATT 

AAGTGGTATT 

AAGTGGTATT 

AAGTGGTATT 

AAGTGGTATT 
********** 



CTCCAAGAGG 

CTCCAAGAGG 

CTCCAAGAGG 

CTCCAAGAGG 

CTCCAAGAGG 

CTCCAAGAGG 

CTCCAAGAGG 

CTCCAAGAGG 

CTCCAAGAGG 
********** 



ATTTTATAAT 

ATTTTATAAT 

ATTTTATAAT 

ATTTTATAAT 

ATTTTATAAT 

ATTTTATAAT 

ATTTTATAAT 

ATTTTATAAT 

ATTTTATAAT 
********** 



CAATGGTGAT 
CAATGGTGAT 
CAATGGTGAT 
CAATGGTGAT 
CAATGGTGAT 
CAATGGTGAT 
CAATGGTGAT 
CAATGGTGAT 
CAATGGTGAT 



1300 
GATTATCAAA 
GATTATCAAA 
GATTATCAAA 
GATTATCAAA 
GATTATCAAA 
GATTATCAAA 
GATTATCAAA 
GATTATCAAA 
GATTATCAAA 



********** ********** 



msal78297 
msal78297.2{ 
msal78297 . 
msal78297.2{ 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297.2{l04 



.2{104_090 
104__18RS21 
2{104_2603 
104_CJB110 
2{104_COH1 
2{104_M732 
2{104_A909 
2{104__M781; 
_JM9130013) 
Consensus 



1301 

TAGTAAAAGG 

TAGTAAAAGG 

TAGTAAAAGG 

TAGTAAAAGG 

TAGTAAAAGG 

TAGTAAAAGG 

TAGTAAAAGG 

TAGTAAAAGG 

TAGTAAAAGG 
********** 



AGATGGAGAG 

AGATGGAGAG 

AGATGGAGAG 

AGATGGAGAG 

AGATGGAGAG 

AGATGGAGAG 

AGATGGAGAG 

AGATGGAGAG 

AGATGGAGAG 
********** 



AGTTTTAAAC 

AGTTTTAAAC 

AGTTTTAAAC 

AGTTTTAAAC 

AGTTTTAAAC 

AGTTTTAAAC 

AGTTTTAAAC 

AGTTTTAAAC 

AGTTTTAAAC 
********** 



TGTTTTCGGA 
TGTTTTCGGA 
TGTTTTCGGA 
TGTTTTCGGA 
TGTTTTCGGA 
TGTTTTCGGA 
TGTTTTCGGA 
TGTTTTCGGA 
TGTTTTCGGA 
********** 



1350 

TAGAAAAGTT 

TAGAAAAGTT 

TAGAAAAGTT 

TAGAAAAGTT 

TAGAAAAGTT 

TAGAAAAGTT 

TAGAAAAGTT 

TAGAAAAGTT 

TAGAAAAGTT 
********** 



msal78297. 2 {104_090} 
msal78297 . 2 { 104_18RS21 } 

msal78297.2{l04_2603} 
msal78297 . 2 { 104_CJB110 J 

msa!782 97 . 2 { 104_COH1 } 



1351 

CCTGTTACTG 
CCTGTTACTG 
CCTGTTACTG 
CCTGTTACTG 
CCTGTTACTG 



GAGGAACGAC 
GAGGAACGAC 
GAGGAACGAC 
GAGGAACGAC 
GAGGAACGAC 



ACAAGCAGCT 
ACAAGCAGCT 
ACAAGCAGCT 
ACAAGCAGCT 
ACAAGCAGCT 



TATCGAGTAC 
TAT CGAGTAC 
TATCGAGTAC 
TATCGAGTAC 
TATCGAGTAC 



1400 
CGCAAAATCA 
CGCAAAATCA 
CGCAAAATCA 
CGCAAAATCA 
CGCAAAATCA 



740 



WO 2004/018646 



PCT/US2003/026827 



Table 41: Comparative Sequences relating to SAG0649 



msal78297 . 2 { 104_M732 
msal78297 . 2 { 104_A909 
msal78297.2(l04_M781 
msal78297 .2{104_JM9130013 

Consensus 



msal78297 
msal78297.2{ 
rnsal78297 . 
msal78297 .2{ 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297 .2(104 



.2{104_090} 
104_18RS2l) 
2{104_2603} 
104_CJB110} 
2{104_COH1} 
2{104_M732 \ 
2{104_A909} 
2{104_M781} 
_JM9130013} 
Consensus 



msal78297 
msal78297 .2{ 
msal78297 . 
msal78297 .2{ 
msal78297 . 
msa!78297 . 
msal78297 . 
msal78297 . 
msa!782 97 .2(104 



.2{l04_090} 
104_18RS21} 
2{104_2603} 
104_CJB110} 
2{104_COH1) 
2{104_M73 2) 
2{l04_A90 9} 
2{l04_M7Sl} 
_JM9130013} 
Consensus 



msa!78297 
msal78297.2{ 
msal78297 . 
msal78297.2{ 
msal78297 . 
msal78297 . 
msa!78297 . 
msal78297 . 
msal78297 .2(104 



.2{104_090} 
104_18RS2l} 
2{104_2603} 
104 CJB110} 
2(104_COH1} 
2(104_M732} 
2{104_A909} 
2{l04_M78lJ 
,_JM9130013} 
Consensus 



msal78297 
msal78297 .2( 
msal78297 . 
msal78297.2( 
msal78297 . 
msal78297. 
msal78297 . 
msal78297 
msal78297. 2(104 



.2(104_090} 
104_18RS2l} 
2{104_2603} 
104_CJB110* 
2(l04_COHi; 
2(104_M732; 
2(104_A909} 
2{104_M781} 
_JM9130013} 
Consensus 



msal78297 
msal78297 .2{ 
msal78297 . 
msal78297.2{ 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297. 2(104 



.2{l04_090} 
104_18RS21} 
2(104_2603' 
104_CJB110 
2(l04_COHl' 
2{104_M732} 
2(104_A909} 
2{104_M781} 
_JM9130013} 
Consensus 



msal78297 
msal78297.2( 
msal78297 . 
msal78297 .2{ 
msa!78297 . 
msal78297. 
msa!78297 . 
msal78297 . 
msal78297 .2(104 



.2(104_090} 
104_18RS2l} 
2(104_2603} 
104_CJB110} 
2(104_COH1} 
2{104_M732} 
2f 104__A909} 
2(104_M781> 
_JM9130013} 
Consensus 



maal78297.2(l04_090} 
msal78297.2(l04_18RS2l} 
msal78297.2{104_2603} 
msal78297 .2 ( 104_CJB110 } 



CCTGTTACTG 
CCTGTTACTG 
CCTGTTACTG 

CCTGTTACTG 
********** 

1401 

ACTCTCTGTA 
ACTCTCTGTA 
ACTCTCTGTA 
ACTCTCTGTA 
ACTCTCTGTA 
ACTCTCTGTA 
ACTCTCTGTA 
ACTCTCTGTA 
ACTCTCTGTA 
********** 

1451 

ATCTCTATTG 
AT CTCT ATTG 
ATCTCTATTG 
ATCTCTATTG 
ATCTCTATTG 
ATCTCTATTG 
ATCTCTATTG 
ATCTCTATTG 
ATCTCTATTG 



GAGGAACGAC 

GAGGAACGAC 

GAGGAACGAC 

GAGGAACGAC 
********** 



ATGAGTAATG 
ATGAGTAATG 
ATGAGTAATG 
ATGAGTAATG 
ATGAGTAATG 
ATGAGTAATG 
ATGAGTAATG 
ATGAGTAATG 
ATGAGTAATG 
********** 



ACAAGCAGCT 

ACAAGCAGCT 

ACAAGCAGCT 

ACAAGCAGCT 
********** 



AGGGATATGC 

AGGGATATGC 

AGGGATATGC 

AGGGATATGC 

AGGGATATGC 

AGGGATATGC 

AGGGATATGC 

AGGGATATGC 

AGGGATATGC 
********** 



TATCGAGTAC 
TATCGAGTAC 
TATCGAGTAC 
TATCGAGTAC 
********** 



AATTAATAGT 

AATTAATAGT 

AATTAATAGT 

AATTAATAGT 

AATTAATAGT 

AATTAATAGT 

AATTAATAGT 

AATTAATAGT 

AATTAATAGT 
********** ********** 



CGCAAAATCA 

CGCAAAATCA 

CGCAAAATCA 

CGCAAAATCA 
********** 

r 

1450 
GGATATATTT 
GGATATATTT 
GGATATATTT 
GGATATATTT 
GGATATATTT 
GGATATATTT 
GGATATATTT 
GGATATATTT 
GGATATATTT 



AACTGGGTCT 
AACTGGGTCT 
AACTGGGTCT 
AACTGGGTCT 
AACTGGGTCT 
AACTGGGTCT 
AACTGGGTCT 
AACTGGGTCT 
AACTGGGTCT 
********** ********** ********** 



GAGAGATTAC 
GAGAGATTAC 
GAGAGATTAC 
GAGAGATTAC 
GAGAGATTAC 
GAGAGATTAC 
GAGAGATTAC 
GAGAGATTAC 
GAGAGATTAC 



1501 

AAGAAAGTTT 
AAGAAAGTTT 
AAGAAAGTTT 
AAGAAAGTTT 
AAGAAAGTTT 
AAGAAAGTTT 
AAGAAAGTTT 
AAGAAAGTTT 
AAGAAAGTTT 
********** 

1551 

ATTATACTTT 
ATTATACTTT 
ATTATACTTT 
ATTATACTTT 
ATTATACTTT 
ATTATACTTT 
ATTATACTTT 
ATTATACTTT 
ATTATACTTT 
********** 

1601 

TTGGGATTGG 
TTGGGATTGG 
TTGGGATTGG 
TTGGGATTGG 
TTGGGATTGG 
TTGGGATTGG 
TTGGGATTGG 
TTGGGATTGG 
TTGGGATTGG 
********** 

1651 

AAATTTATGC 
AAATTTATGC 
AAATTTATGC 
AAATTTATGC 
AAATTTATGC 
AAATTTATGC 
AAATTTATGC 
AAATTTATGC 

AAATTTATGC 
********** 



CTGCAACGAA 

CTGCAACGAA 

CTGCAACGAA 

CTGCAACGAA 

CTGCAACGAA 

CTGCAACGAA 

CTGCAACGAA 

CTGCAACGAA 

CTGCAACGAA 
********** 



AATGGAAATA 
AATGGAAATA 
AATGGAAATA 
AATGGAAATA 
AATGGAAATA 
AATGGAAATA 
AATGGAAATA 
AATGGAAATA 
AATGGAAATA 
********** 



TGTAAACGGA 
TGTAAACGGA 
TGTAAACGGA 
TGTAAACGGA 
TGTAAACGGA 
TGTAAACGGA 
TGTAAACGGA 
TGTAAACGGA 
TGTAAACGGA 
********** 



AATCAATATC 
AATCAATATC 
AATCAATATC 
AATCAATATC 
AATCAATATC 
AATCAATATC 
AATCAATATC 
AATCAATATC 
AATCAATATC 



ACAAATCAAA 
ACAAATCAAA 
ACAAATCAAA 
ACAAATCAAA 
ACAAATCAAA 
ACAAATCAAA 
ACAAATCAAA 
ACAAATCAAA 
ACAAATCAAA 
********** 



ATCCATTTGA 

ATCCATTTGA 

ATCCATTTGA 

ATCCATTTGA 

ATCCATTTGA 

ATCCATTTGA 

ATCCATTTGA 

ATCCATTTGA 

ATCCATTTGA 
********** 



ACTCATGGTG 

ACTCATGGTG 

ACTCATGGTG 

ACTCATGGTG 

ACTCATGGTG 

ACTCATGGTG 

ACTCATGGTG 

ACTCATGGTG 

ACTCATGGTG 
********** 



TAAGACCTAA 
TAAGACCTAA 
TAAGACCTAA 
TAAGACCTAA 
TAAGACCTAA 
TAAGACCTAA 
TAAGACCTAA 
TAAGACCTAA 
TAAGACCTAA 
********** 



GATCCTGGTG 

GATCCTGGTG 

GATCCTGGTG 

GATCCTGGTG 

GATCCTGGTG 

GATCCTGGTG 

GATCCTGGTG 

GATCCTGGTG 

GATCCTGGTG 
********** 



1500 
TCCTAAGACA 
TCCTAAGACA 
TCCTAAGACA 
TCCTAAGACA 
TCCTAAGACA 
TCCTAAGACA 
TCCTAAGACA 
TCCTAAGACA 

TCCTAAGACA 
********** 

1550 
AGCCAACAAC 
AGCCAACAAC 
AGCCAACAAC 
AGCCAACAAC 
AGCCAACAAC 
AGCCAACAAC 
AGCCAACAAC 
AGCCAACAAC 
AGCCAACAAC 
********** 

1600 

ATTTTTACTG 

ATTTTTACTG 

ATTTTTACTG 

ATTTTTACTG 

ATTTTTACTG 

ATTTTTACTG 

ATTTTTACTG 

ATTTTTACTG 

ATTTTTACTG 
********** 

1650 
TGAAGCTGAG 
TGAAGCTGAG 
TGAAGCTGAG 
TGAAGCTGAG 
TGAAGCTGAG 
TGAAGCTGAG 
TGAAGCTGAG 
TGAAGCTGAG 
TGAAGCTGAG 
********** ********** 



AGGTTATGAC 
AGGTTATGAC 
AGGTTATGAC 
AGGTTATGAC 
AGGTTATGAC 
AGGTTATGAC 
AGGTTATGAC 
AGGTTATGAC 
AGGTTATGAC 
********** 



CAACTCCTCT 
CAACTCCTCT 
CAACTCCTCT 
CAACTCCTCT 
CAACTCCTCT 
CAACTCCTCT 
CAACTCCTCT 
CAACTCCTCT 
CAACTCCTCT 



AAGTAAAACA 
AAGTAAAACA 
AAGTAAAACA 
AAGTAAAACA 
AAGTAAAACA 
AAGTAAAACA 
AAGTAAAACA 
AAGTAAAACA 
AAGTAAAACA 



GAAAATTATA 
GAAAATTATA 
GAAAATTATA 
GAAAATTATA 
GAAAATTATA 
GAAAATTATA 
GAAAATTATA 
GAAAATTATA 
GAAAATTATA 



1700 
CTAATGTTGA 
CTAATGTTGA 
CTAATGTTGA 
CTAATGTTGA 
CTAATGTTGA 
CTAATGTTGA 
CTAATGTTGA 
CTAATGTTGA 
CTAATGTTGA 



********** ********** ********** ********** 



1701 

TGATACAAAT AAAATTTATG ATGAGCTAAA 
TGATACAAAT AAAATTTATG ATGAGCTAAA 
TGATACAAAT AAAATTTATG ATGAGCTAAA 
TGATACAAAT AAAATTTATG ATGAGCTAAA 



1750 

TAAATACTTT AAAACAATTG 
TAAATACTTT AAAACAATTG 
TAAATACTTT AAAACAATTG 
TAAATACTTT AAAACAATTG 
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Table 41: Comparative Sequences relating to SAG0649 



msal782 97 .2f 104_COHl} 
msa!78297 . 2 { 104_M732 } 
msal78297 .2{l04_A909} 
msal78297 . 2{l04_M78l} 
msal78297 . 2 { 104_JM9130013 } 

Consensus 



msal78297 
msal78297 .2{ 
msal78297 . 
msal78297 .2{ 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297 .2(104 



.2(l04_090} 
104_18RS2l} 
2{104_2603} 
104_CJB110} 
2{l04_COHl) 
2{104_M732) 
2{l04_A909} 
2{104_M781} 
_JM9130013} 
Consensus 



msal78297 
msal78297 .2{ 
msal78297. 
msal78297 ,2{ 
msal78297 . 
rasal78297 . 
msal78297 . 
msal78297 . 
msal78297 .2(104 



.2{l04_090 
104__18RS21 
2(104_2603] 
104_CJB110; 
2(104_COH1 
2{104_M732 
2(l04_A909 
2(104_M781 
_JM9130013; 
Consensus 



msal78297 -2{l04_090} 
msal78297 .2(104 18RS21} 
msal7 8297 . 2 { 104__2603 } 
msal78297 . 2 { 104_CJB110 ) 
msal78297 . 2 ( 104_COH1 } 
msa!78297.2{l04_M732} 
msal78297.2(l04_A909} 
msal78297.2(l04_M78l} 
msal78297 . 2{l04_JM9130013 } 

Consensus 



msal78297 
msal78297.2( 
msal78297 . 
msal78297.2( 
msal78297 . 
msa!78297 . 
msal78297 . 
msal78297 . 
msal78297. 2(104 



.2{104_090] 
104_18RS2l} 
2{104_2603 
104_CJB110 
2{104_COH1 
2{104_M732; 
2(104 A909; 
2{l04~M78l' 
_JM9130013/ 
Consensus 



msal78 
msal78297 
msal782 
msal78297 
msal782 
msal782 
msal782 
msal782 
msal78297.2{ 



297 .2{104_090} 
.2{104_18RS21} 
97 .2(104_2603} 
.2(104_CJB110} 
97 .2(104_COH1) 
97 .2(104_M732} 
97 .2f 104_A909j 
97.2{104_M781} 
104_JM9130013} 
Consensus 



TGATACAAAT 
TGATACAAAT 
TGATACAAAT 
TGATACAAAT 
TGATACAAAT 
********** 

1751 

TTGAGGAAAA 
TTGAGGAAAA 
TTGAGGAAAA 
TTGAGGAAAA 
TTGAGGAAAA 
TTGAGGAAAA 
TTGAGGAAAA 
TTGAGGAAAA 
TTGAGGAAAA 



msal78297 .2(104_090) 
msal78297 . 2 ( 104_18RS21 } 
msal78297 . 2 ( 104_2603 } 
msal78297 . 2 { 104_CJB110} 
tnsal78297 .2(l04_COHi; 
msal78297 .21 104_M732 
msal78297 .2(104_A909 
msal78297.2(l04_M781 
msal78297 . 2 ( 104_JM9130013 ; 

Consensus 



msal78297.2(l04_090} 
msal78297 . 2 ( 104__18RS21 J 
msal78297 .2{104_2603) 



AAAATTTATG 

AAAATTTATG 

AAAATTTATG 

AAAATTTATG 

AAAATTTATG 
********** 



ACATTCTATT 
ACATTCTATT 
ACATTCTATT 
ACATTCTATT 
ACATTCTATT 
ACATTCTATT 
ACATTCTATT 
ACATTCTATT 
ACATTCTATT 
********** ********** 



ATGAGCTAAA 

ATGAGCTAAA 

ATGAGCTAAA 

ATGAGCTAAA 

ATGAGCTAAA 
********** 



GTTGATGGAA 
GTTGATGGAA 
GTTGATGGAA 
GTTGATGGAA 
GTTGATGGAA 
GTTGATGGAA 
GTTGATGGAA 
GTTGATGGAA 
GTTGATGGAA 
********** 



TAAATACTTT 
TAAATACTTT 
TAAATACTTT 
TAAATACTTT 
TAAATACTTT 



AAAACAATTG 
AAAACAATTG 
AAAACAATTG 
AAAACAATTG 
AAAACAATTG 



********** ********** 



ATGTGACTGA 
ATGTGACTGA 
ATGTGACTGA 
ATGTGACTGA 
ATGTGACTGA 
ATGTGACTGA 
ATGTGACTGA 
ATGTGACTGA 
ATGTGACTGA 
********** 



AATTCCAATT 
AATTCCAATT 
AATTCCAATT 
AATTCCAATT 
AATTCCAATT 
AATTCCAATT 
AATTCCAATT 
AATTCCAATT 
AATTCCAATT 
********** 



1801 

GAGATGATTG 
GAGATGATTG 
GAGATGATTG 
GAGATGATTG 
GAGATGATTG 
GAGATGATTG 
GAGATGATTG 
GAGATGATTG 

GAGATGATTG 
********** 

1851 

TTACGTTTTG 

TTACGTTTTG 

TTACGTTTTG 

TTACGTTTTG 

TTACGTTTTG 

TTACGTTTTG 

TTACGTTTTG 

TTACGTTTTG 

TTACGTTTTG 
********** ********** 



AAAAAATGGT 
AAAAAATGGT 
AAAAAATGGT 
AAAAAATGGT 
AAAAAATGGT 
AAAAAATGGT 
AAAAAATGGT 
AAAAAATGGT 
AAAAAATGGT 
********** 



GTTGGAAATG 
GTTGGAAATG 
GTTGGAAATG 
GTTGGAAATG 
GTTGGAAATG 
GTTGGAAATG 
GTTGGAAATG 
GTTGGAAATG 
GTTGGAAATG 



ATGGCAGTCA 
ATGG CAGTCA 
ATGGCAGTCA 
ATGGCAGTCA 
ATGGCAGTCA 
ATGGCAGTCA 
ATGGCAGTCA 
ATGGCAGTCA 

ATGGCAGTCA 
********** 



CAAAGTTTTA 
CAAAGTTTTA 
CAAAGTTTTA 
CAAAGTTTTA 
CAAAGTTTTA 
CAAAGTTTTA 
CAAAGTTTTA 
CAAAGTTTTA 
CAAAGTTTTA 
********** 



ATTAAAAAAT 

ATTAAAAAAT 

ATTAAAAAAT 

ATTAAAAAAT 

ATTAAAAAAT 

ATTAAAAAAT 

ATTAAAAAAT 

ATTAAAAAAT 

ATTAAAAAAT 
********** 



1901 

TTGGTGGACC 

TTGGTGGACC 

TTGGTGGACC 

TTGGTGGACC 

TTGGTGGACC 

TTGGTGGACC 

TTGGTGGACC 

TTGGTGGACC 

TTGGTGGACC 
********** 

1951 

TATGATAAGA 
TATGATAAGA 
TATGATAAGA 
TATGATAAGA 
TATGATAAGA 
TATGATAAGA 
TATGATAAGA 
TATGATAAGA 
TATGATAAGA 
********** 

2001 

TGGACAAAAA 
TGGACAAAAA 
TGGACAAAAA 
TGGACAAAAA 
TGGACAAAAA 
TGGACAAAAA 
TGGACAAAAA 
TGGACAAAAA 
TGGACAAAAA 
********** 



AAACAGTGAT 
AAACAGTGAT 
AAACAGTGAT 
AAACAGTGAT 
AAACAGTGAT 
AAACAGTGAT 
AAACAGTGAT 
AAACAGTGAT 
AAACAGTGAT 
********** 



CATCTCAAAC 

CATCTCAAAC 

CATCTCAAAC 

CATCTCAAAC 

CATCTCAAAC 

CATCTCAAAC 

CATCTCAAAC 

CATCTCAAAC 

CATCTCAAAC 
********** 



TAAAAGATGT 
TAAAAGATGT 
TAAAAGATGT 
TAAAAGATGT 
TAAAAGATGT 
TAAAAGATGT 
TAAAAGATGT 
TAAAAGATGT 
TAAAAGATGT 



GGGGGAATTT 

GGGGGAATTT 

GGGGGAATTT 

GGGGGAATTT 

GGGGGAATTT 

GGGGGAATTT 

GGGGGAATTT 

GGGGGAATTT 

GGGGGAATTT 
********** ********** ********** 



1800 
TCCTATGGGA 
TCCTATGGGA 
TCCTATGGGA 
TCCTATGGGA 
TCCTATGGGA 
TCCTATGGGA 
TCCTATGGGA 
TCCTATGGGA 

TCCTATGGGA 
********** 

1850 
CACATGATGA 
CACATGATGA 
CACATGATGA 
CACATGATGA 
CACATGATGA 
CACATGATGA 
CACATGATGA 
CACATGATGA 

CACATGATGA 
********** 

1900 

GGTGTGGCTC 

GGTGTGGCTC 

GGTGTGGCTC 

GGTGTGGCTC 

GGTGTGGCTC 

GGTGTGGCTC 

GGTGTGGCTC 

GGTGTGGCTC 

GGTGTGGCTC 
********** 

1950 
TACAGTGACT 
TACAGTGACT 
TACAGTGACT 
TACAGTGACT 
TACAGTGACT 
TACAGTGACT 
TACAGTGACT 
TACAGTGACT 
TACAGTGACT 



CATCAAAATC 

CATCAAAATC 

CATCAAAATC 

CATCAAAATC 

CATCAAAATC 

CATCAAAATC 

CATCAAAATC 

CATCAAAATC 

CATCAAAATC 
********** 



AATCATTTGA 
AATCATTTGA 
AATCATTTGA 
AATCATTTGA 
AATCATTTGA 
AATCATTTGA 
AATCATTTGA 
AATCATTTGA 

AATCATTTGA 
********** 



GTAGTTCTTA 
GTAGTTCTTA 
GTAGTTCTTA 
GTAGTTCTTA 
GTAGTTCTTA 
GTAGTTCTTA 
GTAGTTCTTA 
GTAGTTCTTA 

GTAGTTCTTA 
********** 



CCTATGATGT 
CCTATGATGT 
CCTATGATGT 
CCTATGATGT 
CCTATGATGT 
CCTATGATGT 
CCTATGATGT 
CCTATGATGT 
CCTATGATGT 
********** ********** 



ACGTTTAAAA 
ACGTTTAAAA 
ACGTTTAAAA 
ACGTTTAAAA 
ACGTTTAAAA 
ACGTTTAAAA 
ACGTTTAAAA 
ACGTTTAAAA 
ACGTTTAAAA 



2000 

ACTTAGGAAG 

ACTTAGGAAG 

ACTTAGGAAG 

ACTTAGGAAG 

ACTTAGGAAG 

ACTTAGGAAG 

ACTTAGGAAG 

ACTTAGGAAG 

ACTTAGGAAG 
********** 

2050 

GATAACTATA 

GATAACTATA 

GATAACTATA 

GATAACTATA 

GATAACTATA 

GATAACTATA 

GATAACTATA 

GATAACTATA 

GATAACTATA 
********** 



2051 2100 
TAAGTAACAA ATT TTACAAT ACAAATAATC GTACAACGCT AAGTCCGAAG 
TAAGTAACAA ATT TTACAAT ACAAATAATC GTACAACGCT AAGTCCGAAG 
TAAGTAACAA ATTTTACAAT ACAAATAATC GTACAACGCT AAGTCCGAAG 



742 
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Table 41: Comparative Sequences relating to SAG0649 



msal78297 . 2 { 104__CJB110 
msal78297 .2{104_COH1 
msal78297 . 2 { 104_M732 
msal78297 .2{l04_A909 
msal78297 .2{104_M781} 
msal78297 . 2 { 104_JM913 0013 } 

Consensus 



TAAGTAACAA 
TAAGTAACAA 
TAAGTAACAA 
TAAGTAACAA 
TAAGTAACAA 

TAAGTAACAA 
********** 



ATTTTACAAT 

ATTTTACAAT 

ATTTTACAAT 

ATTTTACAAT 

ATTTTACAAT 

ATTTTACAAT 
********** 



ACAAATAATC 

ACAAATAATC 

ACAAATAATC 

ACAAATAATC 

ACAAATAATC 

ACAAATAATC 
********** 



GTACAACGCT 
GTACAACGCT 
GTACAACGCT 
GTACAACGCT 
GTACAACGCT 
GTACAACGCT 



AAGTCCGAAG 
AAGTCCGAAG 
AAGTCCGAAG 
AAGTCCGAAG 
AAGTCCGAAG 
AAGTCCGAAG 



********** ********** 



msal78297 
msal78297 .2 { 
msal78297 . 
msal78297.2{ 
msal78297 . 
msal78297 . 
msal78297 . 
rasal78297 . 
msal78297.2{l04 



-2{l04_090} 
104_18RS2l} 
2{104_2603} 
104_CJB110} 
2{104_COH1} 
2{104_M732} 
2{104_A909) 
2{104_M781} 
_JM9130013} 
Consensus 



2101 

AGTGAAAAAG 

AGTGAAAAAG 

AGTGAAAAAG 

AGTGAAAAAG 

AGTGAAAAAG 

AGTGAAAAAG 

AGTGAAAAAG 

AGTGAAAAAG 

AGTGAAAAAG 
********** 



AAC CARAT AC 

AACCAAATAC 

AACCAAATAC 

AACCAAATAC 

AACCAAATAC 

AACCAAATAC 

AACCAAATAC 

AACCAAATAC 

AACCAAATAC 
********** 



TATTCGTGAT 

TATTCGTGAT 

TATTCGTGAT 

TATTCGTGAT 

TATTCGTGAT 

TATTCGTGAT 

TATTCGTGAT 

TATTCGTGAT 

TATTCGTGAT 
********** 



TTCCCAATTC 

TTCCCAATTC 

TTCCCAATTC 

TTCCCAATTC 

TTCCCAATTC 

TTCCCAATTC 

TTCCCAATTC 

TTCCCAATTC 

TTCCCAATTC 
********** 



2150 

CCAAAATTCG 

CCAAAATTCG 

CCAAAATTCG 

CCAAAATTCG 

CCAAAATTCG 

CCAAAATTCG 

CCAAAATTCG 

CCAAAATTCG 

CCAAAATTCG 
********** 



msal78297 .2{l04_090} 
msal78297 . 2 {l04_18RS2l} 
rasal78297 . 2 { 104_2603 } 
msal78297 .2{l04 CJB110} 
msa!782 97 . 2 { 1 04_COH1 } 
msal78297 . 2 { 104_M732 } 
msal78297 .2(104_A909} 
msal78297 .2{104_M78l} 
msal78297 . 2 { 104_JM9130013 } 

Consensus 



2151 

TGATGTTCGT 
TGATGTTCGT 
TGATGTTCGT 
TGATGTTCGT 
TGATGTTCGT 
TGATGTTCGT 
TGATGTTCGT 
TGATGTTCGT 
TGATGTTCGT 



GAGTTTCCGG 
GAGTTTCCGG 
GAGTTTCCGG 
GAGTTTCCGG 
GAGTTTCCGG 
GAGTTTCCGG 
GAGTTTCCGG 
GAGTTTCCGG 
GAGTTTCCGG 



TACTAACCAT 
TACTAACCAT 
TACTAACCAT 
TACTAACCAT 
TACTAACCAT 
TACTAACCAT 
TACTAACCAT 
TACTAACCAT 
TACTAACCAT 



CAGTAATCAg 
CAGTAATCAg 
CAGTAATCAg 
CAGTAATCAg 
CAGTAATCAg 
CAGTAATCAg 
CAGTAATCAg 
CAGTAATCAg 
CAGTAAT CAa 



2200 
AAGAAAATGG 
AAGAAAATGG 
AAGAAAATGG 
AAGAAAATGG 
AAGAAAATGG 
AAGAAAATGG 
AAGAAAATGG 
AAGAAAATGG 
AAGAAAATGG 



********** ********** ********** *********_ ********** 



msal78297 
msal78297.2{ 
msal78297 . 
msal78297 .2{ 
msa!78297 . 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297.2{l04 



.2{104_090 
104__18RS21 
2{104_2603 
104_CJB110 
2{104_COH1 
2{104_M732 
2{104_A909 
2{104_M781} 
_JM9130013} 
Consensus 



2201 

GTGAGGTTGA 

GTGAGGTTGA 

GTGAGGTTGA 

GTGAGGTTGA 

GTGAGGTTGA 

GTGAGGTTGA 

GTGAGGTTGA 

GTGAGGTTGA 

GTGAGGTTGA 
********** 



ATTTATTAAA 

ATTTATTAAA 

ATTTATTAAA 

ATTTATTAAA 

ATTTATTAAA 

ATTTATTAAA 

ATTTATTAAA 

ATTTATTAAA 

ATTTATTAAA 
********** 



GTTAATAAAG 

GTTAATAAAG 

GTTAATAAAG 

GTTAATAAAG 

GTTAATAAAG 

GTTAATAAAG 

GTTAATAAAG 

GTTAATAAAG 

GTTAATAAAG 
********** 



ACAAACATTC 

ACAAACATTC 

ACAAACATTC 

ACAAACATTC 

ACAAACATTC 

ACAAACATTC 

ACAAACATTC 

ACAAACATTC 

ACAAACATTC 
********** 



2250 

AGAATCGCTT 

AGAATCGCTT 

AGAATCGCTT 

AGAATCGCTT 

AGAATCGCTT 

AGAATCGCTT 

AGAATCGCTT 

AGAATCGCTT 

AGAATCGCTT 
********** 



msal78297 
msal7B297.2{ 
msal78297 . 
msal78297.2{ 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297.2{l04 



.2{104_090} 
104_18RS2l} 
2{104_2603) 
104_CJB110} 
2{104_COH1} 
2{l04_M732} 
2{104_A909} 
2{l04_M78l} 
_JM9130013} 
Consensus 



2251 

TTGGGAGCTA 
TTGGGAGCTA 
TTGGGAGCTA 
TTGGGAGCTA 
TTGGGAGCTA 
TTGGGAGCTA 
TTGGGAGCTA 
TTGGGAGCTA 
TTGGGAGCTA 



AGTTTCAACT 
AGTTTCAACT 
AGTTTCAACT 
AGTTTCAACT 
AGTTTCAACT 
AGTTTCAACT 
AGTTTCAACT 
AGTTTCAACT 
AGTTTCAACT 



TCAGATAgAA 
TCAGATAgAA 
TCAGATAgAA 
TCAGATAgAA 
TCAGATAgAA 
TCAGATAgAA 
TCAGATAgAA 
TCAGATAgAA 
TCAGATAaAA 



AAAGATTTTT 
AAAGATTTTT 
AAAGATTTTT 
AAAGATTTTT 
AAAGATTTTT 
AAAGATTTTT 
AAAGATTTTT 
AAAGATTTTT 
AAAGATTTTT 



2300 
CTGGGTATAA 
CTGGGTATAA 
CTGGGTATAA 
CTGGGTATAA 
CTGGGTATAA 
CTGGGTATAA 
CTGGGTATAA 
CTGGGTATAA 
CTGGGTATAA 



********** ********** *******_** ********** ********** 



msal78297 
msal78297.2{ 
msal78297 . 
msal78297 .2{ 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297 . 
msal78297.2{!04 



.2{l04_090 
104_18RS21 
2{104__2603 
104_CJB110} 
2{104_COH1} 
2{104_M732 \ 
2{104_A909} 
2{104_M781} 
_JM9130013} 
Consensus 



2301 

GCAATTTGTT 

GCAATTTGTT 

GCAATTTGTT 

GCAATTTGTT 

GCAATTTGTT 

GCAATTTGTT 

GCAATTTGTT 

GCAATTTGTT 

GCAATTTGTT 
********** 



CCAGAGGGAA 

CCAGAGGGAA 

CCAGAGGGAA 

CCAGAGGGAA 

CCAGAGGGAA 

CCAGAGGGAA 

CCAGAGGGAA 

CCAGAGGGAA 

CCAGAGGGAA 
********** 



GTGATGTTAC 

GTGATGTTAC 

GTGATGTTAC 

GTGATGTTAC 

GTGATGTTAC 

GTGATGTTAC 

GTGATGTTAC 

GTGATGTTAC 

GTGATGTTAC 
********** 



AACAAAGAAT 

AACAAAGAAT 

AACAAAGAAT 

AACAAAGAAT 

AACAAAGAAT 

AACAAAGAAT 

AACAAAGAAT 

AACAAAGAAT 

AACAAAGAAT 
********** 



2350 

GATGGTAAAA 

GATGGTAAAA 

GATGGTAAAA 

GATGGTAAAA 

GATGGTAAAA 

GATGGTAAAA 

GATGGTAAAA 

GATGGTAAAA 

GATGGTAAAA 
********** 



msal78297 
msal78297.2{ 
msal78297 . 
msal78297 .2{ 
msal78297 . 
msal78297 . 
msal78297 . 
m9al78297 . 
msa!78297.2{l04 



•2{l04_090} 
104_18RS21} 
2{104_2603} 
104_CJB110} 
2{104_COH1" 
2{104_M732 
2{104_A909' 
2{104_M781} 
_JM9130013} 
Consensus 



2351 

TTTATTTTAA 

TTTATTTTAA 

TTTATTTTAA 

TTTATTTTAA 

TTTATTTTAA 

TTTATTTTAA 

TTTATTTTAA 

TTTATTTTAA 

TTTATTTTAA 
********** 



AGCACTTCAA 

AGCACTTCAA 

AGCACTTCAA 

AGCACTTCAA 

AGCACTTCAA 

AGCACTTCAA 

AGCACTTCAA 

AGCACTTCAA 

AGCACTTCAA 
********** 



GATGGTAACT 
GATGGTAACT 
GATGGTAACT 
GATGGTAACT 
GATGGTAACT 
GATGGTAACT 
GATGGTAACT 
GATGGTAACT 
GATGGTAACT 



ATAAATTATA 
ATAAATTATA 
ATAAATTATA 
ATAAATTATA 
ATAAATTATA 
ATAAATTATA 
ATAAATTATA 
ATAAATTATA 
ATAAATTATA 



********** ********** 



2400 

TGAAATTTCA 

TGAAATTTCA 

TGAAATTTCA 

TGAAATTTCA 

TGAAATTTCA 

TGAAATTTCA 

TGAAATTTCA 

TGAAATTTCA 

TGAAATTTCA 
********** 



msal78297 .2{l04_090l 
msal78297 .2{104_18RS21} 



2401 2450 
AGTCCAGATG GCTATATAGA GGTTAAAACG AAACCTGTTG TGACATTTAC 
AGTCCAGATG GCTATATAGA GGTTAAAACG AAACCTGTTG TGACATTTAC 
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msal78297 . 2 { 104_2603 } 
msal78297 . 2 { 104_CJB110 } 
msal7 8 2 9 7 . 2 { 1 0 4_COHl } 
msal78297 .2{104_M732} 
m3al78297 . 2 ( 104_A909 } 
msal78297 .2{104_M781} 
msal78297 . 2 { 104_JM9130013 } 

Consensus 



msal78297 .2{104_090 
msal78297 . 2 { 104_18RS21 
msal78297 . 2 { 104_2603 
tnsa!78297 . 2 { 104_CJB110 
ms al 7 8 2 9 7 . 2 { 1 0 4_COHl 
msa!78297 . 2 { 104_M732 ] 
. msal78297 .2{104_A909 
msal78297 . 2 { 104_M781 / 
msal78297.2{l04_JM9130013; 

Consensus 



msa!78297 
msal78297 .2{ 
msal78297 . 
msal78297 .2{ 
msal782 97 . 
msa!78297 . 
msa!78297 . 
msal78297 . 
msa!78297 .2 {104 



.2{104_090; 
104_18RS21; 
2{l04_2603 
104_CJB110 
2{104_COH1 
2(104_M732 
2{104_A909* 
2{104_M781 
_JM9130013 
Consensus 



msal78297.2{l04_090 
msal78297 . 2 { 104_JL8RS21 ' 
msal78297 . 2 { 104_2603 
msal78297 . 2 { 104_CJB110 
msaX78297.2{l04_COHl 
msal78297 . 2 { 104_M732 * 
msal782 97 - 2 { 104_A909 
msal78297.2{l04_M78i; 
msal78297 -2{L04_JM9130013; 

Consensus 



msal78297 .2{104_090; 
msal78297 .2 {104_18RS21 
msal78297.2{l04_2603 
msal78297 - 2 { 104_CJB110 
msal78297 . 2 { 104_COH1 
rasal78297.2{l04_M732 
msal78297.2{l04_A90 9 
msa!78297.2{l04_M781 
msal78297 . 2 { 104_JM9130013 ; 

Consensus 



GCTATATAGA 
GCTATATAGA 
GCTATATAGA 
GCTATATAGA 
GCTATATAGA 
GCTATATAGA 
GCTATATAGA 



AGTCCAGATG 

AGTCCAGATG 

AGTCCAGATG 

AGTCCAGATG 

AGTCCAGATG 

AGTCCAGATG 

AGTCCAGATG 
********** 

2451 

AATTCAAAAT 
AATTCAAAAT 
AATTCAAAAT 
AATTCAAAAT 
AATTCAAAAT 
AATTCAAAAT 
AATTCAAAAT 
AATTCAAAAT 

AATTCAAAAT 
********** 

2501 

AAAATCAAAT 
AAAATCAAAT 
AAAATCAAAT 
AAAATCAAAT 
AAAATCAAAT 
AAAATCAAAT 
AAAATCAAAT 
AAAATCAAAT 

AAAATCAAAT 
********** ********** 



GGTTAAAACG 
GGTTAAAACG 
GGTTAAAACG 
GGTTAAAACG 
GGTTAAAACG 
GGTTAAAACG 
GGTTAAAACG 



AAACCTGTTG 
AAACCTGTTG 
AAACCTGTTG 
AAACCTGTTG 
AAACCTGTTG 
AAACCTGTTG 
AAACCTGTTG 



TGACATTTAC 
TGACATTTAC 
TGACATTTAC 
TGACATTTAC 
TGACATTTAC 
TGACATTTAC 
TGACATTTAC 



********** ********** ********** ********** 



GGAGAAGTTA 
GGAGAAGTTA 
GGAGAAGTTA 
GGAGAAGTTA 
GGAGAAGTTA 
GGAGAAGTTA 
GGAGAAGTTA 
GGAGAAGTTA 
GGAGAAGTTA 
********** 



CGGGTATCTT 
CGGGTATCTT 
CGGGTATCTT 
CGGGTATCTT 
CGGGTATCTT 
CGGGTATCTT 
CGGGTATCTT 
CGGGTATCTT 
CGGGTATCTT 



CGAACCTGAA 

CGAACCTGAA 

CGAACCTGAA 

CGAACCTGAA 

CGAACCTGAA 

CGAACCTGAA 

CGAACCTGAA 

CGAACCTGAA 

CGAACCTGAA 
********** 



GAAggaaatg 
GAAggaaatg 
GAAggaaatg 
GAAggaaatg 
GAAggaaatg 
GAAggaaatg 
GAAggaaatg 
GAAggaaatg 



AGCAGATCCA 

AGCAGATCCA 

AGCAGATCCA 

AGCAGATCCA 

AGCAGATCCA 

AGCAGATCCA 

AGCAGATCCA 

AGCAGATCCA 

AGCAGATCCA 
********** 



gtaaacatct 
gtaaacatct 
gtaaacatct 
gtaaacatct 
gtaaacatct 
gtaaacatct 
gtaaacatct 
gtaaacatct 



2500 

AATGCTAATA 

AATGCTAATA 

AATGCTAATA 

AATGCTAATA 

AATGCTAATA 

AATGCTAATA 

AATGCTAATA 

AATGCTAATA 

AATGCTAATA 
********** 

2550 
tattaccaac 
tattaccaac 
tattaccaac 
tattaccaac 
tattaccaac 
tattaccaac 
tattaccaac 
tattaccaac 



GAA- 
***. 



2600 



2551 

actcccaaac gcccaccagg tgtt 

actcccaaac gcccaccagg tgtt 

actcccaaac gcccaccagg tgtttttcct aaaacagggg gaattggtac 

actcccaaac gcccaccagg tgtt 

actcccaaac gcccaccagg tgtt 

actcccaaac gcccaccagg tgtt 

actcccaaac gcccaccagg tgtt 

actcccaaac gcccaccagg tgtt 



****** ********** ********** 



2601 



2650 



aattgtctat atattagttg gttctacttt tatgatactt accatttgtt 



********** ********** ********** ********** ********** 



msa!78297.2{l04_090 
msal78297 . 2 { 104_18RS2i; 

msal78297 .2{104_2603 
msal78297 .2{104_CJB110 
msal78297.2{l04_COHl 
msal78297 .2{104_M732 
msal78297 ,2{104_A909; 
msal78297 .2{104_M781 
rasal78297 . 2 { 104_JM9130013 

Consensus 



2651 



2670 



ctttccgtcg taaacaattg 



********** ********** 



744 



WO 2004/018646 



PCT/US2003/026827 



Table 41: Comparative Sequences relating to SAG0649 



SEQ ID NO. 4110: SAG0649 FROM 2603 V/R GBS TYPE V STRAIN 
MKKRQKI WRGLSVTLLI LSQ I PFG I LVQGETQDTNQALGKVI VKKTGDNATPLGKATFVL 
KNDNDKSETSHETVEGSGEATFENI KPGDYTLREETAPIGYKKTDKTWKVKVADNGATI I 
EGMDADKAEKRKEVLNAQY PKSAI YEDTKENYPLVNVEGSKVGEQYKALNP I NGKDGRRE 
I AEGWLSKKI TGVNDLD KNKYKI ELTVEGKTTVETKELNQPLDVVVLLDNSNSMNNERAN 
NSQRALKAGEAVEKLIDKITSNKDNRVALVTYAST I FDGTEATVSKGVADQNGKALNDSV 
SWDYHKTTFTATTHNYSYLNLTNDANEVNILKSRI PKEAEHINGDRTLYQFGATFTQKAL 
MKANEILETQSSNARKKLI FHVTDGVPTMSYAINFNPYISTSYQNQFNSFLNKI PDRSGI 
LQEDFIINGDDYQIVKGDGESFKLFSDRKVPVTGGTTQAAYRVPQNQLSVMSNEGYAINS 
G Y I YL YWRDYNWVYP FDPKTKKVS ATKQ I KTHGE PTTLYFNGN I RPKGYD I FTVG I GVNG 
DPGATPLEAEKFMQS I SSKTENYTNVDDTNKI YDELNKYFKT I VEEKHS I VDGNVTDPMG 
EM I EFQLKNGQS FTHDD YVL VGNDG SQLKNGVALGGPNSDGG ILKDVTVTYDKTSQTI KI 
NHLNLGSGQKVVLT YDVRLKDNYI SNKFYNTNNRTTLSPKSEKE PNT I RDFP I PKI RDVR 
EFPVLTI SNQKKMGEVEFI KVNKDKHSESLLGAKFQLQI EKDFSGYKQFVPEGSDVTTKN 
DGKI YFKALQDGNYKLYE I S SPDG Y I E VKTKP WT FT I QNGEVTNLKADPNANKNQI GYL 
EGNGKHL I TNTP KRPPGVF PKTGG IGTIVYI LVGSTFM I LT I CSFRRKQL 

SEQ ID NO. 4111: SAG0649 FROM 090 GBS TYPE la STRAIN 

GETQDTNQALGKVI VKKTGDNATPLGKATFVL KNDNDKSETSHETVEGSGEATFENI KPG 
DYTLREETAPIGYKKTDKTWKVKVADNGAT 1 1 EGMDADKAEKRKEVLNAQYPKSAI YEDT 
KENYPLVNVEGS KVGEQ YKALNP I NGKDGRREI AEGWLSKKI TGVNDLDKNKYKI ELTVE 
GKTTVETKELNQPLD VVVLLDNSNSMNNERANNS Q RAL KAGEAVE KL I DKITSNKDNRVA 
LVTYASTIFDGTEATVSKGVADQNGKALNDSVSWDYHKTTFTATTHNYSYLNLTNDANE 
NILKSRI PKEAEHI NGDRTLYQFGATFTQKALMKANE I LETQS SNARKKL I FHVTDG VPT 
MSYAINFNPYISTSYQNQFNSFLNKIPDRSGILQEDFIINGDDYQIVKGDGESFKLFSDR 
KVPVTGGTTQAAYRVPQNQLSVMSNEGYA I NSGY I YLYWRDYNWVYP FDPKTKKVS ATKQ. 
I KTHGEPTTLYFNGNI RPKGYD I FTVG I GVNGDPGAT PLEAEKFMQS I S S KTENYTNVDD 
TNKI YDELNKYFKT I VEEKHS I VDGNVTD PMGEM I EFQLKNGQS FTHDDYVLVGNDGSQL 
KNGVALGGPNSDGG I LKDVTVT YDKTSQT I KINHLNLGSGQKVVLTYDVRLKDNYI SNKF 
YNTNNRTTLS PKSE KEPNT I RD FP I PKI RD VREFP VLTI SNQKKMGEVEF I KVNKDKHS E 
SLI/3AKFQLQIEKDFSGYKQFVPEGSDVTTKNDGKIYFKALQDGNYKLYEISSPDGYIEV 
KTKPWTFTI QNGEVTNLKADPNANKNQI GYLEGNGKHL I TNTP KRPPGV 

SEQ ID NO. 4112: SAGO 649 FROM A909 GBS TYPE la STRAIN 

GETQDTNQALGKVI VKKTGDNATPLGKATFVL KNDNDKSETSHETVEGSGEATFENI KPG 
DYTLREETAPIGYKKTDKTWKVKVADNGAT 1 1 EGMDADKAEKRKEVLNAQYPKSAI YEDT 
KENYPLVNVEGS KVGEQYKALNP I NGKDGRRE I AEGWLSKKITGVNDLD KNKYKI ELTVE 
GKTTVETKELNQPLDVVVLLDNSNSMNNERANNS QRAL KAGEAVE KL IDKI TSNKDNRVA 
LVTYAST I FDGTEATVS KG VAD QNG KALND S VS WD YH KTT FTATTHNYS YLNLTNDANE V 
NI LKSRI PKEAEHINGDRTLYQFGATFTQKALMKANEI LETQSSNARKKLI FHVTDGVPT 
MSYAINFNPYISTSYQNQFNSFLNKI PDRSG I LQEDF 1 1 NGDDYQ I VKGDGES FKLFSDR 
KVPVTGGTTQAAYRVPQNQLS VMSNEG YA INSG Y I YLYWRDYNWVYP FDPKTKKVSATKQ 
I KTHGEPTTLYFNGNI RPKGYD I FTVG IGVNGDPGATPLEAEKFMQS I SSKTENYTNVDD 
TNKI YDEI^KYFKT I VEEKHS I VDGNVTD PMGEM I EFQLKNGQS FTHDDYVLVGNDGSQL 
KNGVAIiGGPNSDGGILKDVTVTYDKTSQTI KI NHLNLGSGQKWLTYDVRLKDNYI SNKF 
YNTNNRTTLSPKSEKEPNTIRDFPIPKIRDWEFPVLTISNQKKMGEVEFIKVNKDKHSE 
SLLGAKFQLQ I E KDFSG YKQFVPEGSDVTTKNDGKI YFKALQDGNYKLYE I S SPDGY I E V 
KTKPWTFTI QNGEVTNLKADPNANKNQI GYLEGNGKHL ITNTPKRPPGV 

SEQ ID NO. 4113: SAG0649 FROM 18RS21 GBS TYPE II STRAIN 

GETQDTNQALGKVI VKKTGDNATPLGKATFVL KNDNDKSETSHETVEGSGEATFENI KPG 
DYTLREETAPIGYKKTDKTWKVKVADNGAT 1 1 EGMDADKAEKRKEVLNAQYPKSAI YEDT 
KENY PLVNVEGS KVGEQYKALNP I NGKDGRRE I AEGWLSKKI TGVNDLDKNKYKI ELTVE 
GKTTVETKELNQPLDVVVLLDNSNSMNNERANNS QRALKAGEAVE KL IDKITSNKDNRVA 
LVTYAST I FDGTEATVSKG VADQNGKALNDS VSWDYHKTTFTATTHNYS YLNLTNDANE V 
NI LKSRI PKEAEHINGDRTLYQFGATFTQKALMKANEI LETQSSNARKKLI FHVTDGVPT 
MSYAINFNPYISTSYQNQFNSFLNKI PDRSGILQEDFIINGDDYQIVKGDGESFKLFSDR 
KVPVTGGTTQAAYRVPQNQLSVMSNEGYA INSGY I YLYWRDYNWVYP FDPKTKKVS ATKQ 
I KTHGEPTTLYFNGNI RPKGYD I FTVG IGVNGDPGATPLEAEKFMQS I SSKTENYTNVDD 
TNKI YDELNKYFKT I VEEKHS I VDGNVTD PMGEM I EFQLKNGQS FTHDD YVL VGNDGSQL 
KNGVALGGPNSDGG I LKDVTVT YDKTSQT I KINHLNLGSGQKVVLTYDVRLKDNYI SNKF 
YNTNNRTTLS PKSE KEPNT I RDFP I PKI RD VREFP VLT I SNQKKMGEVEFI KVNKDKHSE 
SLLGAKFQLQ I E KDFSG YKQFVPEGSDVTTKNDGKI YFKALQDGNYKLYE I S S PDGYI E V 
KTKPWTFT I QNGEVTNLKADPNANKNQIGYLEGNGKHL ITNTPKRPPGV 

SEQ ID NO. 4114: SAG0649 FROM M732 GBS TYPE III STRAIN 

GETQDTNQALGKVI VKKTGDNATPLGKATFVL KNDNDKSETSHETVEGSGEATFENI KPG 
DYTLREETAP I GYKKTDKTWKVKVADNGAT 1 1 EGMDADKAEKRKEVLNAQYPKSAI YEDT 
KENYPLVNVEGSKVGEQYKALNP I NGKDGRRE I AEGWLSKKNTGVNDLD KNKYKI ELTVE 
GKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKAGEAVEKLIDKITSNKDNRVA 
LVTYAST I FDGTEATVSKGVADQNGKALNDSVSWDYHKTTFTATTHNYSYLNLTNDANEV 
NILKSRI PKEAEHI NGDRTLYQFGATFTQKALMKANE I LETQS SNARKKL I FHVTDGVPT 
MSYAINFNPYISTSYQNQFNSFLNKIPDRSGILQEDFIINGDDYQIVKGDGESFKLFSDR 
KVPVTGGTTQAAYRVPQNQLSVMSNEGYAINSGYIYLYWRDYNWVYPFDPKTKKVSATKQ 
I KTHGEPTTL YFNGN I RPKGYDI FTVG I GVNGDPGAT PLEAEKFMQS I S S KTENYTNVDD 
TNKI YDELNKYFKT I VEEKHS I VDGNVTDPMGEM I EFQLKNGQS FTHDDYVLVGNDGSQL 
KNGVALGGPNSDGG I LKDVTVT YDKTSQT I KINHLNLGSGQKVVLTYDVRLKDNYI SNKF 
YNTNNRTTLS PKSEKEPNT I RDFPI PKIRDVREFPVLT I SNQKKMGEVEFI KVNKDKHSE 
SLLGAKFQLQ I EKDFSGYKQFVPEGSDVTTKNDGKI YFKALQDGNYKLYE I SSPDGYI EV 
KTKPWTFT I QNGEVTNLKADPNANKNQI GYLEGNGKHL ITNTPKRPPGV 

SEQ ID NO. 4115: SAG0649 FROM COH1 GBS TYPE III STRAIN 

GETQDTNQALGKV I VKKTGDNATPLGKAT F VLKNDNDKS ETSHET VEGSGXATFENI KPG 
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DYTLREETAP I GYKKTDKT WKVKVADNGAT 1 1 EGMDADKAEKRKEVLNAQYPKSAI YEDT 
KENYPLVNVEGSKVGEQYKALNPINGKDGRREI 

GKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKAGEAVEKLIDKITSNKDNRVA 
LVTYAST I FDGTEATVS KGVADQNGKALNDS VSWD YHKTTFTATTHNYS YLNLTNDANEV 
NI LKSRI PKEAEH I NGDRTLYQFGATFTQKALMKANE I LETQS SNARKKL I PHVTDGVPT 
MS YA I NFNP Y I STS YQNQFNSFLNKI PDRSGI LQEDF 1 1 NGDD YQ I VKGDGES FKLFSDR 
KVPVTGGTTQAAYRVPQNQLSVMSNEGYAINSGYIYLYWRDYNWVYPFDPKTKKVSATKQ 
I KTHGEPTTLYFNGNI RPKGYD I FTVG IGVNGDPGATPLEAEKFMQS I SSKTENYTNVDD 
TNKI YDELNKYFKT I VEEKHS I VDGNVTDPMGEM I EFQLKNGQS FTHDDYVLVGNDGSQL 
KNGVALGGPNSDGGI LKDVTVTYDKTSQTI KINHLNLGSGQKVVLTYDVRLKDNYI SNKF 
YNTNNRTTLSPKSEKEPNTIRDFP I PKIRDVREFPVLTI SNQKKMGEVEF IKVNKDKHSE 
SLLGAKFQLQ I EKDFSGYKQFVPEGSDVTTKNDGK I YFKALQDGNYKLYE J SSPDGY I EV 

KTKPWTFTI QNGE VTNLKADPNANKNQ I GYLEGNGKHL I TNTPKRP PG V 

SEQ ID NO. 4115: SAG0649 FROM M781 GBS TYPE III STRAIN 

GKVIVKKTGDTATPLGKATFVLKNDNDKSETSHETVEGSGKATFENIKPGDYTLREETAP 
IGYKKTDKTWKVKVADNGAXI I EGMDADKAEKRKEVLNAQYPKSAIYEDTKENYPLVNVE 
GSKVGEQYKALNPI NGKDGRRE I AEGWLSKKITGVNDLDKNKYKI ELTVEGKTTVETKEL 
NQPLDVWLLDNSNSMNNERANNSQRALKAGEAVEKLI DKITSNKDNRVALVTYASTI FD 
GTEATVSKGVADQNGKAI^SVSWDraKTTFTATTHNYSYI^LT^ANEWILKSRIPKE 
AEHI NGDRTLYQFGATFTQKALMKANE I LETQS SNARKKL I FHWDGVPTMSYA I NFNP Y 
I STS YQNQFNSFLNKI PDRSGI LQEDF I INGDDYQ I VKGDGESFKLFSDRKVPVTGGTTQ 
AAYRVPQNQLSVMSNEGYAINSGYI YLYWRDYNWVYPFDPKTKKVSATKQI KTHGEPTTL 
YFNGN I RPKGYD I FTVG I GVNGDPGATPLEAEKFMQS I S S KTENYTNVDDTNKI YDELNK 
YFKTI VEEKHS I VDGNVTDPMGEMI EFQLKNGQS FTHDDYVLVGNDGSQLKNGVALGGPN 
SDGG I LKDVTVTYDKTSQT I KI NHLNLGSGQKWLTYDVRLKDNY I SNKFYNTNNRTTLS 
PKSEKEPNT I RDFPI PKIRDVREFPVLTI SNQKKMGEVEF IKVNKDKHSESLLGAKFQLQ 
I EKDFSG YKQFVPEGSDVTTKNDGKI YFKALQDGNYKLYE I S S PDGY I EVKTKP WTFT I 
QNGE VTNLKADPNANKNQ I GYLEGNGKHL I TNTPKRP PGV 

SEQ ID NO. 4117: SAG0649 FROM CJB110 GBS NONTYP EABLE STRAIN 

GETQDTNQALGKVI VKKTGDNATPLGKATFVLKNDNDKSETSHETVEGSGXATFENI KPG 
DYTLREETAP I GYKKTDKTWKVKVADNGAT 1 1 EGMDADKAEKRKEVLNAQYPKSAI YEDT 
KENYPLVNVEGS KVGEQYKALNP I NGKDGRRE I AEGWLS KKI TGVNDLDKNKYKI ELTVE 
GKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKAGEAVEKLIDKITSNKDNRVA 
L VTYAS T I FDGTEATVS KG VAD QNG KALND S VS WD YH KTT FTATTHNY S YLNLTNDANE V 
NI LKSRI PKEAEHINGDRTLYQFGATFTQKALMKANE ILETQSSNARKKLI FHVTDGVPT 
MS YAINFNPYI STSYQNQFNSFLNKI PDRSGI LQEDFI INGDDYQ I VKGDGESFKLFSDR 
KVPVTGGTTQAAYRVPQNQLSVMSNEGYAI NSGY I YLYWRD YNWVYP FD PKTKKVSATKQ 
IKTHGEPTTLYFNGNIRPKGYDIFTVGIGWGDPGATPLEAEKFMQS I SSKTENYTNVDD 
TNKIYDELNKYFKTIVEEKHSIVDGNVTDPMGEMIEFQLKNGQSFTHDDYVLVGNDGSQL 
KNGVALGGPNSDGGI LKDVTVTYDKTSQTI KI NHLNLGSGQKWLTYDVRLKDNY I SNKF 
YNTNNRTTLS PKSEKEPNT I RDFP I PKIRDVRE F P VLT I SNQKKMGE VE F I KVNKDKHS E 
SLLGAKFQLQIEKDFSGYKQFVPEGSDVTTKNDGKI YFKALQDGNYKLYE I SSPDGYI EV 
KTKP WT FT I QNGE VTNLKADPNANKNQI GYLEGNGKHL I TNTPKRPPG V 

SEQ ID NO. 4118: SAG0649 FROM JM9130013 GBS TYPE VIII STRAIN 

GETQDTNQALGKVIVKKTGDNATPLGKATFVLKNDNDKSETSHETVEGSGEATFENIKPG 
DYTLREETAP IGYKKTDKTWKVKVADNGATI I EGMDADKAEKRKEVLNAQYPKSAI YEDT 
KENYPLVNVEGSKVGEQYKAIjNPINGKDGRREIAEGWLSKKITGVNDIjDKNKYKIELT^ 
GKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKAGEAVEKLIDKITSNKD 
LVTYAST I FDGTEAWSKGVADQNGKALNTOSVSWDYHKTTFTATTHNYSYLNLTNDANEV 
NI LKSRI PKEAEHINGDRTLYQFGATFTQKALMKANE ILETQSSNARKKLI FHVTDGVPT 
MS YA I NFNP Y I STS YQNQFNS FLNKI PDRSG I LQEDF 1 1 NGDD YQ I VKGDGESFKLF SDR 
KVPVTGGTTQAAYRVPQNQLSVMSNEG YA I NSGY I YLYWRD YNWVYP FDPKTKKVSATKQ 
I KTHGEPTTL YFNGN I RPKGYD I FTVG IGVNGDPGATPLEAEKFMQS I SSKTENYTNVDD 
TNKI YDELNKYFKT I VEEKHS I VDGNVTDPMGEM I EFQLKNGQSFTHDDYVLVGNDGSQL 
KNGVALGGPNSDGGI LKDVTVTYDKTS QT I KI NHLNLGSGQKWLTYDVRLKDNY I SNKF 
YNTNNRTTLS PKSEKEPNT I RD FP I PKI RDVREF PVLT I SNQKKMGEVE F I KVNKDKHS E 
SLLGAKFQLQIKKDFSGYKQFVPEGSDVTTKNDGKI YFKALQDGNYKLYE I SSPDGYI EV 
KTKPWTFTI QNGE VTNLKADPNANKNQI GYLE 
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PRETTY of: /biotmp/msal78371 . 2 {*} May 12, 2003 09:25 



msal78371.2{l04_CJB110 
msal78371 . 2 f 104_M781 
msa!78371 . 2 { 104_COH1 
rasal78371 . 2 { 104_M732 
msal78371 . 2 { 104_090 
msal78371 . 2 { 104_18RS21 
msal78371.2{l04__2603' 
msal78371.2{l04_A909; 
msal78371.2{l04_JM9130013} 

Consensus 



msal78371 .2{ 
msal78371 . 
msa!78371 . 
msal78371 . 
msal78371 
msal78371 ,2{ 
msal78371. 
msal78371 
msal78371.2{!04 



104_CJB110; 
2{l04_M781 
2{104_COH1 
2{l04_M732/ 
.2{104_090; 
104_18RS2l) 
2{l04_2603} 
2{104_A909} 
_jJM9130013} 
Consensus 



msal78371.2{ 
msal78371 . 
msal78371 . 
msal78371. 
msal78371 
rasal78371.2{ 
msal78371 . 
msal78371. 
msal78371.2{!04 



104_CJB110} 
2{104_M781} 
2{104_COH1} 
2{104JM732} 
,2{104_090} 
104_18RS2l} 
2{104_2603} 
2{104_A909} 
: _JM9130013} 
Consensus 



1 50 
ge tqdtnqalGK V I VKKTGDnA 

GK VIVKKTGDtA 

-ge tqdtnqalGK VI VKKTGDnA 

ge tqdtnqalGK VI VKKTGDnA 

ge tqdtnqalGK VI VKKTGDnA 

ge tqdtnqalGK VI VKKTGDnA 

mkkrqkiwrg lsvtllilsq ipfgilvqge tqdtnqalGK VI VKKTGDnA 

ge tqdtnqalGK VI VKKTGDnA 

ge tqdtnqalGK VI VKKTGDnA 

********** ********** ******** — ** ********-* 



msal78371 .2 { 104_CJB110 } 
msal78371.2{l04_M78l} 
tnsal78371 . 2 { 104_COH1 } 
msal78371.2{l04_M732) 
msal78371.2{l04_090} 
msal78371.2{l04_18RS2l} 
msa!78371 - 2 { 104_2603 } 
msal78371.2{l04_A909> 
msal78371.2{l04_JM913 0013} 

Consensus 



msal78371 . 2 { 104_CJB110 
msal78371.2{l04_M781 
msal78371.2{l04_COHl 
msal78371.2{!04_M732l 
msal78371 . 2 {104_090 } 
msal78371.2{l04_18RS21 
msal78371.2{!04_2603 
msal78371 . 2 { 104_A909 
msal78371.2{l04_JM9130013, 

Consensus 



msal78371 . 2 { 104_CJB110} 
msal78371.2{l04_M78l} 
msal78371 . 2 (104_COH1* 
msal78371.2{104_M732' 
msal78371. 2 { 104^090 ; 
msal78371.2{l04_18RS2l} 
msal783 71.2{!04_2603* 
msal78371.2{l04_A909' 
msal78371.2{l04_JM9130013; 

Consensus 



msal78371.2{ 

msal78371. 

msal78371. 

msa!78371. 
msal78371 
msal7837l.2{ 

msal78371. 

msal78371. 



104_CJB110} 

2{104 M78i; 

2{104~COH1 

2{104__M732 

.2{104_090} 

104_18RS2l} 

2{104_2603) 

2{104_A909} 



KNDNDKSETS 
KNDNDKSETS 
KNDNDKSETS 
KNDNDKSETS 
KNDNDKSETS 
KNDNDKSETS 
KNDNDKSETS 
KNDNDKSETS 

KNDNDKSETS 
********** 



KVADNGAt I I 
KVADNGAxI I 
KVADNGAt I I 
KVADNGAt I I 
KVADNGAt I I 
KVADNGAt I I 
KVADNGAt I I 
KVADNGAt I I 
KVADNGAt I I 



HETVEGSGxA 
HETVEGSGkA 
HETVEGSGxA 
HETVEGSGeA 
HETVEGSGeA 
HETVEGSGeA 
HETVEGSGeA 
HETVEGSGeA 
HETVEGSGeA 



TFENIKPGDY 
TFENIKPGDY 
TFENIKPGDY 
TFENIKPGDY 
TFENIKPGDY 
TFENIKPGDY 
TFENIKPGDY 
TFENIKPGDY 
TFENIKPGDY 



********_* ********** ********** 



51 

TPLGKATFVL 

TPLGKATFVL 

TPLGKATFVL 

TPLGKATFVL 

TPLGKATFVL 

TPLGKATFVL 

TPLGKATFVL 

TPLGKATFVL 

TPLGKATFVL 
********** 

101 

YKKTDKTWKV 

YKKTDKTWKV 

YKKTDKTWKV 

YKKTDKTWKV 

YKKTDKTWKV 

YKKTDKTWKV 

YKKTDKTWKV 

YKKTDKTWKV 

YKKTDKTWKV 
********** 

151 

NYPLVNVEGS 
NYPLVNVEGS 
NYPLVNVEGS 
NYPLVNVEGS 
NYPLVNVEGS 
NYPLVNVEGS 
NYPLVNVEGS 
NYPLVNVEGS 
NYPLVNVEGS 

********** ********** ********** *****★★**- 



100 

TLREETAP I G 
TLREETAPIG 
TLREETAP I G 
TLREETAPIG 
TLREETAPIG 
TLREETAPIG 
TLREETAPIG 
TLREETAPIG 
TLREETAPIG 



EGMDADKAEK 
EGMDADKAEK 
EGMDADKAEK 
EGMDADKAEK 
EGMDADKAEK 
EGMDADKAEK 
EGMDADKAEK 
EGMDADKAEK 
EGMDADKAEK 



RKEVLNAQYP 
RKEVLNAQYP 
RKEVLNAQYP 
RKEVLNAQYP 
RKEVLNAQYP 
RKEVLNAQYP 
RKEVLNAQYP 
RKEVLNAQYP 
RKEVLNAQYP 



150 

KSAIYEDTKE 
KSAIYEDTKE 
KSAIYEDTKE 
KSAIYEDTKE 
KSAIYEDTKE 
KSAIYEDTKE 
KSAIYEDTKE 
KSAIYEDTKE 
KSAIYEDTKE 



*******_** ********** ********** ********** 



KVGEQYKALN 
KVGEQYKALN 
KVGEQYKALN 
KVGEQYKALN 
KVGEQYKALN 
KVGEQYKALN 
KVGEQYKALN 
KVGEQYKALN 
KVGEQYKALN 



PINGKDGRRE 
PINGKDGRRE 
PINGKDGRRE 
PINGKDGRRE 
PINGKDGRRE 
PINGKDGRRE 
PINGKDGRRE 
PINGKDGRRE 
PINGKDGRRE 



lAEGWLSKKi 
IAEGWLSKKi 
IAEGWLSKKn 
lAEGWLSKKn 
IAEGWLSKKi 
IAEGWLSKKi 
lAEGWLSKKi 
lAEGWLSKKi 
IAEGWLSKKi 



200 

TGVNDLDKNK 
TGVNDLDKNK 
TGVNDLDKNK 
TGVNDLDKNK 
TGVNDLDKNK 
TGVNDLDKNK 
TGVNDLDKNK 
TGVNDLDKNK 
TGVNDLDKNK 
********** 



201 

YKI ELTVEGK 
YKIELTVEGK 
YKI ELTVEGK 
YKIELTVEGK 
YKIELTVEGK 
YKIELTVEGK 
YKIELTVEGK 
YKIELTVEGK 
YKIELTVEGK 



TTVETKELNQ 
TTVETKELNQ 
TTVETKELNQ 
TTVETKELNQ 
TTVETKELNQ 
TTVETKELNQ 
TTVETKELNQ 
TT VE TKELNQ 
TTVETKELNQ 



PLDVWLLDN 
PLDVWLLDN 
PLDVWLLDN 
PLDVWLLDN 
PLDVWLLDN 
PLDVWLLDN 
PLDVWLLDN 
PLDVWLLDN 
PLDVWLLDN 



SNSMNNERAN 
SNSMNNERAN 
SNSMNNERAN 
SNSMNNERAN 
SNSMNNERAN 
SNSMNNERAN 
SNSMNNERAN 
SNSMNNERAN 
SNSMNNERAN 



250 

NSQRALKAGE 
NSQRALKAGE 
NSQRALKAGE 
NSQRALKAGE 
NSQRALKAGE 
NSQRALKAGE 
NSQRALKAGE 
NSQRALKAGE 
NSQRALKAGE 



********** ********** ********** ********** ********** 



251 

AVEKLIDKIT 
AVEKLIDKIT 
AVEKLIDKIT 
AVEKLIDKIT 
AVEKLIDKIT 
AVEKLIDKIT 
AVEKLIDKIT 
AVEKLIDKIT 
AVEKLIDKIT 



SNKDNRVALV 
SNKDNRVALV 
SNKDNRVALV 
SNKDNRVALV 
SNKDNRVALV 
SNKDNRVALV 
SNKDNRVALV 
SNKDNRVALV 
SNKDNRVALV 



********** ********** 



TYASTIFDGT 
TYASTIFDGT 
TYASTIFDGT 
TYASTIFDGT 
TYASTIFDGT 
TYASTIFDGT 
TYASTIFDGT 
TYASTIFDGT 
TYASTIFDGT 
********** 



EAT VS KGVAD 
EATVSKGVAD 
EAT VS KGVAD 
EATVSKGVAD 
EATVSKGVAD 
EATVSKGVAD 
EATVSKGVAD 
EATVSKGVAD 
EATVSKGVAD 



300 

QNGKALNDSV 
QNGKALNDSV 
QNGKALNDSV 
QNGKALNDSV 
QNGKALNDSV 
QNGKALNDSV 
QNGKALNDSV 
QNGKALNDSV 
QNGKALNDSV 



********** ********** 



301 

SWDYHKTTFT 
SWDYHKTTFT 
SWDYHKTTFT 
SWDYHKTTFT 
SWDYHKTTFT 
SWDYHKTTFT 
SWDYHKTTFT 
SWDYHKTTFT 



ATTHNYSYLN 
ATTHNYSYLN 
ATTHNYSYLN 
ATTHNYSYLN 
ATTHNYSYLN 
ATTHNYSYLN 
ATTHNYSYLN 
ATTHNYSYLN 



LTNDANE VN I 
LTNDANEVNI 
LTNDANEVNI 
LTNDANEVNI 
LTNDANEVNI 
LTNDANEVNI 
LTNDANEVNI 
LTNDANEVNI 



LKSRIPKEAE 
LKSRI PKEAE 
LKSRIPKEAE 
LKSRIPKEAE 
LKSRIPKEAE 
LKSRIPKEAE 
LKSRIPKEAE 
LKSRIPKEAE 



350 

HI NGDRTLYQ 
HINGDRTLYQ 
HI NGDRTLYQ 
HINGDRTLYQ 
HINGDRTLYQ 
HINGDRTLYQ 
HINGDRTLYQ 
HINGDRTLYQ 
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Table 41: Comparative Sequences relating to SAG0649 



msal78371 . 2 { 104_0*M9130013 } 

Consensus 



SWDYHKTTFT ATTHNYSYLN LTNDANEVNI LKSRIPKEAE HINGDRTLYQ 
********** ********** ********** ********** ********** 



msal78371.2{ 
msa!78371 . 
msal78371. 
msal78371 . 
msal78371 
rnsal78371.2{ 
msal78371 . 
msal78371. 
msal78371.2{!04 



104_CJB110) 
2{104_M781} 
2{104_COH1} 
2{l04_M732} 
.2{l04_090} 
104_18RS2l} 
2{104_2603} 
2{l04_A909} 
_JM9130013} 
Consensus 



351 

FGATFTQKAL 
FGATFTQKAL 
FGATFTQKAL 
FGATFTQKAL 
FGATFTQKAL 
FGATFTQKAL 
FGATFTQKAL 
FGATFTQKAL 
FGATFTQKAL 



MKANEILETQ 
MKANEILETQ 
MKANEILETQ 
MKANEILETQ 
MKANEILETQ 
MKANEILETQ 
MKANEILETQ 
MKANEILETQ 
MKANEILETQ 



********** ********** 



SSNARKKLIF 

SSNARKKLIF 

SSNARKKLIF 

SSNARKKLIF 

SSNARKKLIF 

SSNARKKLIF 

SSNARKKLIF 

SSNARKKLIF 

SSNARKKLIF 
********** 



HVTDGVPTMS 
HVTDGVPTMS 
HVTDGVPTMS 
HVTDGVPTMS 
HVTDGVPTMS 
HVTDGVPTMS 
HVTDGVPTMS 
HVTDGVPTMS 
HVTDGVPTMS 



400 

YAINFNPYIS 
YAINFNPYIS 
YAINFNPYIS 
YAINFNPYIS 
YAINFNPYIS 
YAINFNPYIS 
YAINFNPYIS 
YAINFNPYIS 
YAINFNPYIS 



********** ********** 



msal78371.2{ 
msal78371 . 
msal78371 . 
msal78371 . 
msa!78371 
msal78371 .2 { 
msal7837l. 
msal78371 . 
rasal78371 .2{104 



104_CJB110 
2{104_M781 
2{104_COH1 
2{l04_M732 
.2{l04_090 
104_18RS21 
2{104_2603; 
2{104_A909} 
_JM9130013} 
Consensus 



401 

TSYQNQFNSF 

TSYQNQFNSF 

TSYQNQFNSF 

TSYQNQFNSF 

TSYQNQFNSF 

TSYQNQFNSF 

TSYQNQFNSF 

TSYQNQFNSF 

TSYQNQFNSF 
********** 



LNKIPDRSGI 

LNKIPDRSGI 

LNKIPDRSGI 

LNKIPDRSGI 

LNKIPDRSGI 

LNKIPDRSGI 

LNKIPDRSGI 

LNKIPDRSGI 

LNKIPDRSGI 
********** 



LQEDFIINGD 

LQEDFIINGD 

LQEDFIINGD 

LQEDFIINGD 

LQEDFIINGD 

LQEDFIINGD 

LQEDFIINGD 

LQEDFIINGD 

LQEDFIINGD 
********** 



DYQIVKGDGE 

DYQIVKGDGE 

DYQIVKGDGE 

DYQIVKGDGE 

DYQIVKGDGE 

DYQIVKGDGE 

DYQIVKGDGE 

DYQIVKGDGE 

DYQIVKGDGE 
********** 



450 

SFKLFSDRKV 

SFKLFSDRKV 

SFKLFSDRKV 

SFKLFSDRKV 

SFKLFSDRKV 

SFKLFSDRKV 

SFKLFSDRKV 

SFKLFSDRKV 

SFKLFSDRKV 
********** 



msal78371.2{ 
msal78371 . 
msal78371 . 
msal78371 . 
msal78371 
msal78371.2{ 
msal78371 . 
msal78371 . 
msal78371 .2(104 



104_CJB110' 
2{104_M781 
2{l04_COHl~ 
2{104_M732 
.2{104_090 
104_18RS21~ 
2{104_2603 
2{104_A909 
_JM9130013J 
Consensus 



451 

PVTGGTTQAA 
PVTGGTTQAA 
PVTGGTTQAA 
PVTGGTTQAA 
PVTGGTTQAA 
PVTGGTTQAA 
PVTGGTTQAA 
PVTGGTTQAA 
PVTGGTTQAA 



YRVPQNQLSV 
YRVPQNQLSV 
YRVPQNQLSV 
YRVPQNQLSV 
YRVPQNQLSV 
YRVPQNQLSV 
YRVPQNQLSV 
YRVPQNQLSV 
YRVPQNQLSV 



********** ********** 



MSNEGYAINS 

MSNEGYAINS 

MSNEGYAINS 

MSNEGYAINS 

MSNEGYAINS 

MSNEGYAINS 

MSNEGYAINS 

MSNEGYAINS 

MSNEGYAINS 
********** 



GYIYLYWRDY 
GYIYLYWRDY 
GYIYLYWRDY 
GYIYLYWRDY 
GYIYLYWRDY 
GYIYLYWRDY 
GYIYLYWRDY 
GYIYLYWRDY 
GYIYLYWRDY 
********** 



500 

NWVYPFDPKT 

NWVYPFDPKT 

NWVYPFDPKT 

NWVYPFDPKT 

NWVYPFDPKT 

NWVYPFDPKT 

NWVYPFDPKT 

NWVYPFDPKT 

NWVYPFDPKT 
********** 



msal78371.2{ 
msal78371 . 
msal78371 . 
msal78371 . 
msal78371 
msal78371 .2{ 
msal78371 . 
msal78371 . 
msal78371.2{!04 



104_CJB110 
2{104_M781 
2{104_COH1 
2{104_M732 
.2{104_090 
104_18RS21 
2{104_2603 
2{104_A909 
_JM9130013 
Consensus 



501 

KKVSATKQIK 

KKVSATKQI K 

KKVSATKQIK 

KKVSATKQIK 

KKVSATKQIK 

KKVSATKQIK 

KKVSATKQIK 

KKVSATKQIK 

KKVSATKQIK 
********** 



THGEPTTLYF 

THGEPTTLYF 

THGEPTTLYF 

THGEPTTLYF 

THGEPTTLYF 

THGEPTTLYF 

THGEPTTLYF 

THGEPTTLYF 

THGEPTTLYF 
********** 



NGNIRPKGYD 
NGNIRPKGYD 
NGNIRPKGYD 
NGNIRPKGYD 
NGNIRPKGYD 
NGNIRPKGYD 
NGNIRPKGYD 
NGNIRPKGYD 
NGNIRPKGYD 



IFTVGIGVNG 
IFTVGIGVNG 
IFTVGIGVNG 
IFTVGIGVNG 
IFTVGIGVNG 
IFTVGIGVNG 
IFTVGIGVNG 
IFTVGIGVNG 
IFTVGIGVNG 



********** ********** 



550 

DPGATPLEAE 

DPGATPLEAE 

DPGATPLEAE 

DPGATPLEAE 

DPGATPLEAE 

DPGATPLEAE 

DPGATPLEAE 

DPGATPLEAE 

DPGATPLEAE 
********** 



rnsal78371.2{ 
msal78371 . 
msal78371 . 
msal78371 . 
msal78371 
msal78371.2{ 
msal78371 . 
msal78371. 
msal78371.2{l04 



104_CJB110 
2{104_M781 
2{104_COH1 
2{104_M732; 
.2{l04_090 
104_18RS21 
2{l04_2603 
2{104_A909 
_JM9130013] 
Consensus 



551 

KFMQSISSKT 

KFMQSISSKT 

KFMQSISSKT 

KFMQSISSKT 

KFMQSISSKT 

KFMQSISSKT 

KFMQSISSKT 

KFMQSISSKT 

KFMQSISSKT 
********** 



ENYTNVDDTN 
ENYTNVDDTN 
ENYTNVDDTN 
ENYTNVDDTN 
ENYTNVDDTN 
ENYTNVDDTN 
ENYTNVDDTN 
ENYTNVDDTN 

ENYTNVDDTN 
********** 



KIYDELNKYF 

KIYDELNKYF 

KIYDELNKYF 

KIYDELNKYF 

KIYDELNKYF 

KIYDELNKYF 

KIYDELNKYF 

KIYDELNKYF 

KIYDELNKYF 
********** 



KTIVEEKHSI 

KTIVEEKHSI 

KTIVEEKHSI 

KTIVEEKHSI 

KTIVEEKHSI 

KTIVEEKHSI 

KTIVEEKHSI 

KTIVEEKHSI 

KTIVEEKHSI 
********** 



600 

VDGNVTD PMG 

VDGNVTD PMG 

VDGNVTD PMG 

VDGNVTD PMG 

VDGNVTD PMG 

VDGNVTDPMG 

VDGNVTDPMG 

VDGNVTDPMG 

VDGNVTDPMG 
********** 



msal78371 .2{ 
msal78371, 
msa!78371, 
msa!78371 . 
msal78371 
msal78371.2{ 
msal78371 . 
msal78371 . 
msal78371.2{l04 



104_CJB110 
2{l04__M78l' 
2{104_COH1 
2{104_M732 
.2{104_090 
104_18RS21 
2{104_2603 
2{104_A909' 
_JM9130013 
Consensus 



601 

EMIEFQLKNG 
EMIEFQLKNG 
EMIEFQLKNG 
EMIEFQLKNG 
EMIEFQLKNG 
EMIEFQLKNG 
EMIEFQLKNG 
EMIEFQLKNG 
EMIEFQLKNG 



QSFTHDDYVL 
QSFTHDDYVL 
QSFTHDDYVL 
QSFTHDDYVL 
QSFTHDDYVL 
QSFTHDDYVL 
QSFTHDDYVL 
QSFTHDDYVL 
QSFTHDDYVL 



VGNDGSQLKN 
VGNDGSQLKN 
VGNDGSQLKN 
VGNDGSQLKN 
VGNDGSQLKN 
VGNDGSQLKN 
VGNDGSQLKN 
VGNDGSQLKN 
VGNDGSQLKN 



GVALGGPNSD 
GVALGGPNSD 
GVALGGPNSD 
GVALGGPNSD 
GVALGGPNSD 
GVALGGPNSD 
GVALGGPNSD 
GVALGGPNSD 
GVALGGPNSD 



********** ********** ********** ********** 



650 

GGILKDVTVT 

GGILKDVTVT 

GGILKDVTVT 

GGILKDVTVT 

GGILKDVTVT 

GGILKDVTVT 

GGILKDVTVT 

GGILKDVTVT 

GGILKDVTVT 
********** 



msal78371 . 2 { 104_CJB110 } 
msal78371.2f 104_M78l} 
msal78371.2{l04_COHl} 
msa!78371 . 2 { 104_M732 } 
msal78371.2{l04_090} 

msal78371 . 2 { 104_18RS21 ) 
msa!78371 . 2{104_2603} 



651 

YDKTSQTIKI 
YDKTSQTIKI 
YDKTSQTIKI 
YDKTSQTIKI 
YDKTSQTIKI 
YDKTSQTIKI 
YDKTSQTIKI 



NHLNLGSGQK 
NHLNLGSGQK 
NHLNLGSGQK 
NHLNLGSGQK 
NHLNLGSGQK 
NHLNLGSGQK 
NHLNLGSGQK 



WLTYDVRLK 
WLTYDVRLK 
WLTYDVRLK 
WLTYDVRLK 
WLTYDVRLK 
WLTYDVRLK 
WLTYDVRLK 



DNYISNKFYN 
DNYI SNKFYN 
DNYISNKFYN 
DNYISNKFYN 
DNYISNKFYN 
DNYISNKFYN 
DNYISNKFYN 



700 

TNNRTTLSPK 
TNNRTTLSPK 
TNNRTTLSPK 
TNNRTTLSPK 
TNNRTTLSPK 
TNNRTTLSPK 
TNNRTTLSPK 
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Table 41: Comparative Sequences relating to SAG0649 



msal78371.2{l04_A909} YDKTSQTIKI NHLNLGSGQK WLTYDVRLK DNYISNKFYN TNNRTTLSPK 
msal78371.2{l04_JM9130013} YDKTSQTIKI NHLNLGSGQK WLTYDVRLK DNYISNKFYN TNNRTTLSPK 

Consensus ********** ********** ********** ********** ********** 



msal78371.2{ 
msal78371 . 
msa!78371 . 
msa!78371 . 
msa!7837l 
msal78371.2{ 
msal78371 . 
msal78371 . 
msal78371.2{l04 



104J2JB110} 
2{104_M781} 
2{104_COH1} 
2{104_M732} 
.2{l04_090} 
104_18RS2l} 
2{104_2603} 
2{l04_A909} 
_JM9130013} 
Consensus 



701 

SEKEPNTIRD 
SEKEPNTIRD 
SEKEPNTIRD 
SEKEPNTIRD 
SEKEPNTIRD 
SEKEPNTIRD 
SEKEPNTIRD 
SEKEPNTIRD 
SEKEPNTIRD 



FPIPKIRDVR 
PPIPKIRDVR 
FPIPKIRDVR 
FPIPKIRDVR 
FPIPKIRDVR 
FPIPKIRDVR 
FPIPKIRDVR 
FPIPKIRDVR 
FPIPKIRDVR 



EFPVLTISNQ 
EFPVLTISNQ 
EFPVLTISNQ 
EFPVLTISNQ 
EFPVLTISNQ 
EFPVLTISNQ 
EFPVLTISNQ 
EFPVLTISNQ 
EFPVLTISNQ 



********** ********** ********** 



KKMGEVEFIK 

KKMGEVEFIK 

KKMGEVEFIK 

KKMGEVEFIK 

KKMGEVEFIK 

KKMGEVEFIK 

KKMGEVEFIK 

KKMGEVEFIK 

KKMGEVEFIK 
********** 



750 

VNKDKHSESL 

VNKDKHSESL 

VNKDKHSESL 

VNKDKHSESL 

VNKDKHSESL 

VNKDKHSESL 

VNKDKHSESL 

VNKDKHSESL 

VNKDKHSESL 
********** 



msal78371.2{ 
msal78371 . 
msal78371 . 
msa!78371 . 
msal78371 
msal78371 .2{ 
msal78371 . 
msal78371 . 
msa!78371 ,2{104 



104_CJB110} 
2(104_M7S1} 
2?104_COH1} 
2{104_M732} 
.2{104_090} 
104_18RS2l} 
2{104_2603} 
2{104_A909} 
_JM9130013} 
Consensus 



751 

LGAKFQLQIe 
LGAKFQLQIe 
LGAKFQLQIe 
LGAKFQLQIe 
LGAKFQLQIe 
LGAKFQLQIe 
LGAKFQLQIe 
LGAKFQLQIe 
LGAKFQLQIk 



KDFSGYKQFV 
KDFSGYKQFV 
KDFSGYKQFV 
KDFSGYKQFV 
KDFSGYKQFV 
KDFSGYKQFV 
KDFSGYKQFV 
KDFSGYKQFV 
KDFSGYKQFV 



*********„ ********** 



PEGSDVTTKN 

PEGSDVTTKN 

PEGSDVTTKN 

PEGSDVTTKN 

PEGSDVTTKN 

PEGSDVTTKN 

PEGSDVTTKN 

PEGSDVTTKN 

PEGSDVTTKN 
********** 



DGKIYFKALQ 

DGKIYFKALQ 

DGKIYFKALQ 

DGKIYFKALQ 

DGKIYFKALQ 

DGKIYFKALQ 

DGKIYFKALQ 

DGKIYFKALQ 

DGKIYFKALQ 
********** 



800 

DGNYKLYEIS 

DGNYKLYEIS 

DGNYKLYEIS 

DGNYKLYEIS 

DGNYKLYEIS 

DGNYKLYEIS 

DGNYKLYEIS 

DGNYKLYEIS 

DGNYKLYEIS 
********** 



msal78371.2{ 
msa!78371 . 
msal78371 . 
msal78371. 
msal78371 
msal78371.2{ 
tnsal78371 . 
msal78371 . 
msa!78371 ,2{104 



104_CJB110} 
2{104_M781} 
2{104_C0H1} 
2{104_M732} 
.2{104_090} 
104_18RS2l) 
2fl04_2603} 
2{104_A909} 
_JM9130013} 
Consensus 



801 

SPDGYIEVKT 
SPDGYIEVKT 
SPDGYIEVKT 
SPDGYIEVKT 
SPDGYIEVKT 
SPDGYIEVKT 
SPDGYIEVKT 
SPDGYIEVKT 
SPDGYIEVKT 



KPWTFTIQN 
KPWTFTIQN 
KPWTFTIQN 
KPWTFTIQN 
KPWTFTIQN 
KPWTFTIQN 
KPWTFTIQN 
KPWTFTIQN 
KPWTFTIQN 



GEVTNLKADP 
GEVTNLKADP 
GEVTNLKADP 
GEVTNLKADP 
GEVTNLKADP 
GEVTNLKADP 
GEVTNLKADP 
GEVTNLKADP 
GEVTNLKADP 



NANKNQIGYL 
NANKNQIGYL 
NANKNQIGYL 
NANKNQIGYL 
NANKNQIGYL 
NANKNQIGYL 
NANKNQIGYL 
NANKNQIGYL 
NANKNQIGYL 



********** ********** ********** ********** 



msal78371.2{ 
rasal78371. 
msa!78371. 
msal78371 . 
tnsal78371 
msal78371.2{ 
n\sal78371 . 
rasa!78371 . 
msal78371.2{l04 
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104_CJB110} tpkrppgv — 

2{104_M781} tpkrppgv — 

2{104_COH1} tpkrppgv 

2{104_M732} tpkrppgv- 

.2{104_090} tpkrppgv — — 

104_18RS21 } tpkrppgv — 

2{l04_2603} tpkrppgvfp ktggigtivy ilvgstfmil ticsfrrkql 

2 { 104_A909 } tpkrppgv — 

_JM9130013} 

Consensus ** ********** ********** ********** 
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Egngkhlitn 

Egngkhlitn 

Egngkhlitn 

Egngkhl i tn 

Egngkhlitn 

Egngkhlitn 

Egngkhlitn 

Egngkhlitn 

E 

* 
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